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This thesis is intended to halp non-statiticia 
nonparametric tests to thair fullas®. The text e 
when nonparamatric tests are preferabla to parametr s 
how to choose the most appropriate test, and how to use the 
nonparametric software package. 

Parametric tests are of+en the prafezred form of statis- 
tics because sf their acceptance in the academic world, the 
availability of computer support, and flexibility in model 
puslding. Nonparametric tests provide a valuable alterna- 
tive to parametrics, because they may be applied to data 
nich, for any one of a number of reasons, canno* be 
analyzed with parametric tests. The simplicity of nonparame- 


tric tests often make the test statistics easier *o computs 


and test results easier to explain. Section II gives a 
detailed comparison of parametric versus nonparametric 
tests. 


When confronted with the same hypothesis, various parta- 
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metric tests give consistent results because each е 
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extracts + ha same +n£>r oe 210n from the. data 
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Nonparametric tests may yield varyiagy results with the s 
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me 
data, because different tests extract different information 
h 


from the data set, TO overcome «this potential problen, + 


iD 


most appropriate nonparametric t2st should be chosen to 
analyze the data set. Section IV gives guidance on 
selecting the best test. 

In order to make using nonparanetric tests as easy as 
possible, +w programs were written to verform he most 
common 24 nonparametric +2sts. A general description of how 
Fo use the pragreas is contained in section ITI. Sections Y 


Espough XXVIII contain dascriptions of sach 5f the tests, 





oeta lsd instructions on their use. The appendices 


contain more detailed information about the programs. 
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Appendix A is a tutorial which depicts use of programs 
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describe the two programs and two files. Appendix G 
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to perform a test on a data Set. Appendices ТЫТ 
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contains the interactive Pascal program, Lochinvar, м 
Appendix F contains the Fortran program Crunch. 
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This section contrasts the nonparametric statistical 
test with parametric statistics. To do this, there isa 
short summary addressing various levels of measurement, and 
a brief discission on the relationship between the normal 
distribution and interval data. Finally there is a compar- 
ison of the advantages of the two classes of tests. This 
will help the reader decida whether to use the tests of this 
text. 


wee «LEVEES OF MEASUREMENT 


Statisics manipulate raw data t> extract whatever infor- 
mation that lata contains. Level >f measurement indicate 
how much information is contained ia the data, and how *hat 
information may be manipulated. Although formally there ars 
four levels of measurement, as defiaed by Stevens [RBRef. 2], 
there are only three which matter when choosing the test: 
nominal, ordinal, and interval. As a consistent rule any 
information contained in data of a lower level of measure- 
ment is contained in data 5f a higher level; any manipula- 
tion permitted on a lower level is permitted on a higher 
level. 

Nominal data elements contain vary little information: 
only membership in a subset of the total data set can ba 
determined. Manipulation of *he nominal data sets is 
limited to counting the number of elements in each subset. 

Ordinal data elements contain information relative +9 

ther elements: element A is either less than, greater than, 
or equal to Sach other element in the set. However, tha 
question "greater than by how much" canno+ be answered. The 


data set may be ranked. 
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Interval data elements contain  aores inforıaticn. NƏ £ 
nly can elenent A be compared to other elements, bu+ 
inequalities can be quantified; e.j., A>B implies that «here 
exists a k such that A=Btk, and that the k can be found by 
К=А-В. The lata set can be manipulateä with normal arich- 
Metic Operations, such as addition, division, or raising to 
a power. 


B. LEVEL OF MEASUREMENT AND DISTRIBUTION ASSUMPTIONS 


The normal distribution is completely sp¢scified by two 
parameters: 2xpected value and variance. In estimating 
those parameters forma data set, arithmetic operations are 
used, and intervals are treated as though they were mean- 
inaful. For 2xample, the expected value is estimated by the 
sample mean, which is found by sumning the data elements and 
dividing by the number of elements. The variance is found 
by summing the squares of the int2rvals between each data 
elament and the sample mean, and dividing by the number of 
elements minus one. Because of th2 use of arithmetic opera- 
tions and int2rvals in computing its parameters, the normal 
distributon seems to require interval data sets. 

Parametric +ests assume that a data set contains 
elements with an interval level of measurement drawn from a 
normally distributed population. Comparison of two data 
sets usually require the assumption of equal variance in the 
two populations from which the sampl2s were drawn. There is 
a fair amount of literatura concerning how valid parametric 
tests are when any of these assumptions are violated; a 
large, vocal group of statisticians would say that such 
tests produce valid results even when the assumptions ars 
decidedly not met. But meeting all of the assumptions of 
parametric tests are the guarantee of validit when using 


such test. 


11 





Nonparametric tests make no assumptions about the under- 
hy ota aero Ono tho population from which the Cata iS 
drawn. Additionally, there ar2 nonparametric tests to 
analyze data with nominal level of measurement or higher. 
Because some nonparametric tests require only nominal or 
ordinal data, and don't make assunptions about population 
distributions, the power of thos2 tests may be less than 
that of paramatric tests-- assuming that the data set justi- 


fied use of parametrics in the first placz. 


C. ADVANTAGES AND DISADVANTAGES OF CLASSES OF TESTS 


The advantages and disadvantages of parametric tests are 
the mirror image of those of nonparanatric test. Therefore, 
the advantages of sach class cf tests is given. 

1. Advantages of Parametric T2sts 

The general advantages of parametric tests are wide 
acceptance of methodology, consistency of results, and capa- 
Duty to handle multiple variable in systematic model 
Barıidıng. 

a. Wile Acceptance 

The wide acceptance of parametric test has a 
number of advantages. Statistical software packages contain 
a very complete set of parametric tests, and only a small--- 
although increasing over time--number of nonparametric 
tests. Table of the +haoretical distributions needed to 
judge test statistics (the normal, student's t, chi-square 
and F) are extensive and ubiquitous. The terms of the test 
are known to a majority cf persons trained in the social or 
physical sciences; whether the tecms are fully understood 


may not influence the value of their racognition. 
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b. Consistency of Results 

Different parametric tests give the same conclu- 
sions, given the same null and alternative hypotheses. 
Parametric tests all extract the same information from the 
data, and merely rearrange it for different purposes. The 
parametric distributions are all derived from “the normal.*' 
With related test statistics and similar test distributions 
for critical values, itis not surprising that different 
techniques give consistent answers to the same question. 

Nonparametrics may provide different results to 
the same question on the same data sat, because different 
tests extract different types of information from the data 
set. For example, with paired data elements, the sign test 
considers only which member of tha pair is larger; the 
Signed rank test needs to know not only which is larger, but 
also the maanitude of that difference. Consistency of 
results is therefor a possible advantage for parametric 
tests. However, this advantage can be minimized by 
selecting the best (most powerful) nonparametric test 
possible, as discussed in à later section. 

C. Versatility in Model Building 

The final, and probably most important, advantage 
cf  parametrics is their versatility in explaning very 
complex interactions in data. Analysis of variance can test 
differences in treatment effects, multiple blocking effects, 
and interaction among treatments and blockings. Regression 
can be made to do analysis of variance, (by using nothing 
but dichotomous variables), but is even more flexible, by 
building models using variables with continuous values 
rather than only a fixed number of treatment levels. 


! The t is a normal, discribution in which the lack of 
knowledge of the population expected vaiue is corrected for 
by lowered kurtosis; the chi-squars distribution of degrees 
of freedom nis the sum of squares of n standard normal 
distributions; the F is the ratio of two chi-squares. 
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This versatility of paranetrics is based on the 
very strong assumptions made about the populations from 
which the data is drawn: literature beyond the elementary 
statistics textbook level contains many ways to determine if 
the assumptions are met, and what to do when they are not 
net. 

Nonparametric tests, on tha other hand, assume 
nothing about the underlying distribution, and often work 
with ncminal or ordinal data. Nos surprisingly, only very 
straight-forward answers can be given for any one test. A 
clever application of a sequence of tests to the same data 
set may, however, yield answers to a number of different 
questions, and provide some inferences on interaction. 


2. Advantages of Nonparametric Tests 


Emm Wr == 





The advantages of nonparamatric tests are simplicity 
and applicability. 
ae ol pl icity 

Nonparametric tests' simplicity allows for rela- 
tively easy calculaticn of the test statistic and critical 
value for a null hypothesis. This simplicity also allows 
nonparametric tests to be explainsi то, and defended in 
front of, persons who are not trained in statistics. 

The test statistic for most nonparametic tests 
can be calculated by hand. For example, a set of 20 pairs 
of data elements (X,Y) can be compared for equality of 
expected value (E(X)=E(Y)) by either the t-test or the 
nonparametric sign test. The t-test needs sample means and 
variances, so some arithmetic operations are required: four 
sums of 20 terms each, 40 squaring of numbers, four divi- 
sions. In contrast, the sign test requires counting in how 
many pairs «he X element is greater than the Y slement. The 
contrast is not always so one sided; as the amount of infor- 
nation extracted from the data incr23asées, so does complexity 


pee ouputatccon. for example, rank-ordar nonparametric tests 
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require ranking of data elements, which can become tedious 
in large data sets. However, the process of ranking tends 
to be self-correcting, because an unranked or misranked data 
value will stand out ina simple sequential check of the 
ranks. Arithnetic mistakes in squaring, summing, et cetera, 
May not be so obvious unless the operation is redone. When 
computations are made by hand, therefore, nonparametric test 
statisistics are easier +o find. 

Since hand-held calculators are even more ubiqui- 
tous than parametric tables, this advantage of nonparametric 
test is less important than it once was. £ the number of 
keystroke errors is proportional to che number of numbers 
entered in the calculator, though, the simplicity of nonpar- 
ametric tests may still be an advantage. 

Simplicity could also be an advantage in the rare 


case when tables of distribution are not available. Many 
nonparametric test distributions, especially the simpler 
tests, rest on the binomial distribution. For small sample 


sizes, the binomial can be calculated by hand; with a small 
calculator, a distribution for sample size of 20 or 30 can 
easily be calculated. The normal, student's t, chi-square 
and F distributions lack closed forms for their cumulative 
distribution functions. Numerical integration--even if the 
density functions were remembered or available--would be 
ehallenging with a calculator (since all but the normal 
require one or more gamma functions) and probably is beyond 
the realm of paper and pencil mathematics. 

Simplicity is also a virtu2 in explaining test 
results. Comparison and rankings are fairly easy to follow 
(although sometimes difficult to perform), while the arith- 
Metic operations of param2tric tests are less transparent. 
Most non-statisticians have had experience in tossing coins, 
rolling dice or some games of chance. Analogies can be made 
from those experiences to the evaluation of test results. 
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In the previous example of 20 pairs (X,Y), with the sign 
paccmDroduegng a test Statistic of 4 (1.2., in four pairs, 
X>Y), an analogy could ba made to the odds that a coin were 
fair if it came up heads four times in 20. Explaining that 
same data set with the t-test would require mention of stan- 
dard deviations and fooled variances, and perhaps drawing of 
overlapping bell-shaped curves. An audience predisposed to 
disagree with the test results might be hard to convince 
with parametric tests. 
ps Applicability 

Nonparametric tests can be used on a larger 
number of data set than can parametic tests, because of the 
lack of assumptions about the distribution of the underlying 
populaticn, and the ability to handle ordinal or interval 
data. This advantage is more vital in the "real world" than 
in academia, for three general reasons: 

(1) An analyst is sometimes handed data sets to 
do something with; for any one of a number of reasons, para- 
metric tests may not applicable. Most commonly, the form in 
which the data was ccllected in may fail to meet the under- 
lying requirements of parametrics, but meets the require- 
ments of nonparametrics. 

(2) The cost of gathering intervai data is 
prohibitive, while ordinal or nominal data can be gathered 
at low ccst. For example, on a field exercise, uncalibrated 
eyes can generally judge distances greater than or less 
than, but some sort of instrumentation must be used to 
determine the exact magnitude of the difference. The expen- 
Sively equipped national test ranges are a response to the 
need for very accurate interval level of measurement data. 
masoratories, too, ame often equipped for fine metrics. But 
in field or operational tests, ordinal level of measurement 
may be the highest level available. 
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(3) The experiment was designed to gather 
interval data, from populations with common variance, but 
something went wrong. Instrumentation sometimes falls. 
Even if the measurement is accurate, variances may not be 
common or the data may not be transformable to look normal. 
Rather than giving up, or repeating the experiment,  nonpar- 
ametric tests may answer some questions originally to be 


answered by analysis of variance or regression. 
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A. GENERAL 


Minze sect Omn gives 3 brief  desctipticn cf “hs computer 
package. Tests are described in sections V through XXVIII, 
while the Appendices describe thea programs and files in 
detail. 

The computer package consists эб two computer programs, 
Lochinvar and Crunch. Also involved are threa types of CMS 
files: data files and an options file, written by Lochinvar 
to be read by Crunch; and output files, written by Crunch, 


containing test results. 


B. LOCHINVAR 


The program Lochinvar (LOad CHoic=s and INput VARiablss) 
is the user's main contact with the programs. It is inter- 
active, prompting and screening tha user's entries. tes 
purpose is to writ2 two types of files needed by the program 
Gainch: the™data file, and the option file. 

1. Data Entry 
Lochinvar allows +the user +9 input data in two sepa- 
rate ways. The user must decide which one to use before 
starting the program. 
a. Method I: Data Values into a Data File 
When the data is numerical, the user should 
strongly consider this method. Lochinvar prompts entry of 
data set and variable names, and of the data set values 
themselves. All that information is sent to a file on the 
user's A disk; Muceussrashooses 312 9f Six files, "loch! 


Ee s; "Lochi E", into which the data is written. Unless 
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erased or ovar-written, the data will be available for 
repeated use. 

All 24 of the data tests in this text can compute 
test statistics from dat2 files. However, a data file 
itself is not enough to perform atest; the option file is 
needed to reference the data file and communicate the choice 
of test and test parameters to Crunch. After a data file i 
written, Lochinvar prompts the writing of the options file, 
as described in subsection 2, below. 

b. Method II: Cell Counts in thə Options File 

When the data 1S non-numerical, this is the user's 
only option; 1% шау be more convenient even if the data is 
numerical. Four of the nonparamstric tests handle nominal 
data--binomial, McNemar, chi-square test for independence, 
and Cochran. Instead of antering all the data values, the 
user may run one of these tests by entering the appropriate 
count of elements which fall ints 2 test's subcategories. 
For example, the binomial test can ba run on a large data 
set by simply entering two numbers: the count of "successes" 
and the count of "failures". 

In this method of data entry, Lochinvar prompts 


entry of cell counts at the same time irt is prompting the 


test selection and choice of parameter. All that informa- 
fron is written to the options file; no data file is 
written. 


2. Ihe Options File 

The purpose of the options file is to tell Crunch 
everything it needs to know to perform a nonparametric test. 
In all cases, the options file contains the user's selection 
of which test to be performed, the parameters to be used in 
Bat test, and to which CMS output fila the output should be 
Written. If the data values are t5 be used (Method I), the 
cption file states which data file Crunch should read. I 
cell counts are used (Method II), the optiors file contains 


those counts, as well as the data set name. 
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The options file is described in dətail, with an 
example, in Appendix D. 
3. Using Lochinvar 
In euer, суре "LOCHI". The program will displaya 
sequence of questions; the user's response will depend on 
the method of data enty and test salection. 
a. Use of Previously Enter Data 

The program asks "do you wish to perform a test 
on a previously entered data set?", i.e., one of the six CMS 
ВЕ 7 тесеиосн а s ...," loch» f. 

If the answer is yes, then Lochinvar asks which 
data set to use. The program reads the data set's name, and 
displays it to the user, and asks "Is this the right data 
set?" If yes, the user indicates so, and the test selection 
is next. If no, the the user must choose another file. 

If the answer to the quastion of previously enter 
data is no, then Lochinvar asks whether data entry is by 
data values (Method I) or cell counts (Method II). If the 
user wants to enter cell counts, che next step is to select 
a test. If the user wants to enter a data set, the program 
asks: 

(1) To which file the input data is to be written 

(2) What is the data set name, number of vari- 
ables and variable names. 

The program will then prompt entry of data 
values. If there are two variables in the data set, che 
user will be offered the choice of entering data pair by 
pair; this will only work if there are an equal number of 
elements in each of the two variables. Normally, all the 
data elements of the first variable will be entered 
together, then all of the second variable, e+ cetera. After 
entry of each variable, Lochinvar will offer the user the 
Chance to check, change or add data elements to the values 


just entered. 


20 





After the data set's information has been entered, 
ШЕЕ пуат writes it to the specified file. werting Of the 
options file is next. 

b. Selection of the Test to be Performed 

After the choice of data entry has been given, 
Lochinvar offers a menu of tests from which the user may 
choose. If call counts was the chosen method of data entry, 
four tests are offered. If a data file is to be used, the 
choice set of test offered is determined by: 

(1) the number of variables in the data set 

(2) whether the varibles have an equal number o£ 
elements 

Lochinvar will not offer a test unless the data 
set contains at least the minimum number of variables needed 
to perform the test. 

Some nonparametric tests require that data 


elements of one variable be related to corresponding 
elements cf other variables,  i.e., that the data set be 
paired or blocked. Lochinvar will ‘not offer such a test 


unless there are the same number of elements in each of the 
data set's variables. However, variables can be equal sized 
but not paired or blocked. The user should not choose an 
inappropriate test just because Lochinvar offers it. 
C Output File Selection 
The user chooses among six possible output files, 
Sind"... YZ listing". The files will be written to 
user's A disk after execution of the program Crunch. 
Previous contents of the designated file will be overwritten 
after Crunch runs. 
d. Test Parameter Entries 
Each test has different entries required. See the 
section on the desired test, sections V through XXVIII for 
specifics. 
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C. CRUNCH 


ter the options file has been written, the user 
executes the test by typing "CRUNCH" while in CMS. 
2. Actions of the Program Crunch 
Crunch reads the options file, and when necessary, 
the designated data file. It performs the nonparametric 
test requested, and reminds the user with a display on the 
Screen to which output file the results have been written. 
Crunch actually writes three files: 
a. The Output File 
rRe ou pür files are nan: "u listing", ..., "Z 
eting". At the top of of these file is the label, 
consisting of the name of the data sat, the number of vari- 
ables in the data set, the number of variables used in the 
test (if less than the number in the whole data set), and 
the names of the variables used in the test. Following the 
label is the name of the test performed. The rest of the 
output file's format varies, depending on the test chosen. 
In general, the output file contains only the most salient 
results of the computations. Al] output files contain 
either a test statistic, or a test statistic and the empir- 
ical level of significance for the iesired null hypothesis. 
b. The Problem Listing File 
If there are any problems encountered by the 
computer that result in Fortran error or warning messages, 
such messages will be written to the CMS file "problem 
i sting. 
C. The Nptest Listing File 
This file contains intermediate computations of 
the program as it works towards the test statistic. For 
example, the Smirncv test requires finding differences in 
probabilities of two empirical distributions. Since the 
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test statistic is simply the largest absolute difference, 
the output file will only list the largest positive and most 
negative difference, as well as the test statistic. "Nptest 
listing", among other things, will include each difference 
between the empirical cumulative distribution functions 
(cd£'s). This listing file is not arranged to be attractive 
or suitable for submission, but it is labelled, and may be 


of some use in understanding the operations of Crunch. 


D. ADDITIONAL FEATURES OF THE PACKAGE 


fo avoid SVeGgwEa ting input or output files, six of each 
were provided. If more are needed, the user can rename some 
of the existing files. 

Lochinvar is a convenient way to enter data bv hand. 
When data is generated from other programs or read from disk 
or tape, the user may want to create his own data file 
rather than use Lochinvar. Detalled instructions for this 
are contained in Appendix C. | 

Crunch is written in Fortran. Users who are experienced 
programmers may want to modify it to suit their own tastes, 
F adapt portions Sf it into their own programs. An expla- 
nation of the program is containad in Appendix E. The 
program is listed in Appendix G. 


E. CONSIDERATIONS IN DATA ENTRY 


Lochinvar is written in Pascal, a convenient language 
for interactive programs, but a language which may be unfa- 
miliar to many users. This section provides sufficient 
EHSIght to Lochinvar so «nat the user can efficiently commu- 


nicate choices tc the progran. 
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1. Entry of Numbers 
When entering numerical data, Lochinvar requires 
that the first character read is either a numeral or a minus 
sign followed by a numeral. Therefore, to enter a number 
between minus one and plus one, precede the decimal with a 
"Оч. 
When entering more than one number at a time, leave 
a space (not a comma) between the numbers. 
If Lochinvar expects a number and gets anything 
Eee, it will halt with an error. This is a convenient way 
to get out of the progran quickly, but anything entered on 
that run of Lochinvar will be lost. The one exception is 
when entering a data set of data values and an options file 
in a single run. After the data set is entered, the program 
writes it to the designated file, and asks "now, do a test 
with this data?" Any subsequent =rrors will not effect the 
data file itself, just erase all additional entries. 
PRSEENELrY/OE Chazaeters 


When the program pronpts for a single letter 


response, e.g., "enter Y Or N", enter "Y" or "N" and press 
the enter key. The program expects and will read only the 
first letter. Single letter entrias are contained in 


"repeat...until" loops which will not allow the user to 
procede until one of the offered choices is made. 

Do not use the alternate character (apl) setting on 
the keyboard while using Lochinvar. 

When Lochinvar prompts for the data set name or 
variable names, the user must enter from one to 48 charac- 
ters. These names are one of the faw things for which the 
program does not offer a chance to correct an error. The 
user must be careful, checking the names prior to hitting 
the "enter" kay. If an error is made anyway, the data or 
options file can be corrected using Xadit in CMS. 


24 





3. Minimum Entry 
If the program prompts for an entry, enter at least 
one numeral or character. The interactive read commands are 
u "do while not eoln " (end of lins) loops; if the enter 


key is pressed with nothing to enter, Шели мати на11 halt 
with an error, because it attempted to read beyond the end 
of the line. As with numerical z=ntry errors, previous 


entries will be lost. 
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ТЯ. ВО5ТЕ 


my 
IO 
i'd 


A. CRITERIA FOR SELECTINS THE MOST APPROPRIATE TEST 


In selecting a nonparametric test, the user must 
consider three criteria: purpose of the test, data set level 
of measurement, and whether the data elements are related 


(i.2e., paired or blocked). 


ZB CRTEERTONET: PURPOSE OF TEST 


Nonparametric tests can be successfully used to tes- for 
questions of location, independence/interaction, and distri- 
bution. Table 1 shows a matrix of the nonparametric test 
with level cf measurement (LOM) 92 the horizontal axis and 
general purpose on the-vertical. 

Location usually refers to whether the expected value of 
one population is equal to that of another, based on sample 
variables. Other purpos2s are tasting for proportion of 
successes or for value of a single population's quantile. 
Table 2 lists the location tests with their null hypotheses, 
to help the user narrow tha choice set. 

Independence of two samples refer to statistical inde- 
pendence, 3.6., correlation of zero. Interaction may be a 
before/after effect, or prediction of one variable by 
another. Table 3 lists these tests by their specific null 
hypotheses. 

Some tests for distribution check for equal variance 
among variables. Other tests chac a single  variable's 
empirical cumilative distribution function (ecäf) against a 
theoretical distribution,  wi*h either hypothesized or ssti- 
mated parameters. Still other tests compare the empirical 
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TABLE 1 
Nonparametric Tests 








Goodness of Fit 
(chi-square,N> 14) 
Wilkes-Shapiro Son) 
Lilliefors (N<31) 


2 var Smirnov | 
Cramer-von Mises 


X Interval Ordinal Nominal 
LOCATION: 
1 var Quantile Binomial 
2 var Wilcoxon* Signe — 
Mann-Whitney 
| 22 var Quade* Median Cochran 
Friedman* 
Kruskal . 
-Wallis 
INDEPENDENCE/ 
INTERACTION: 
1 var COX SCUTT McNemar 
| 2 чаг Correlation Cox-Stuart* R*C Contingency 
NP Regression (chi-square)* 
Monotone. 
| Regression 
DISTRIBUTION: 
1 var Kolmogorov (N<31) 


SA O ee O ee en re was tet eet, SE A A LI E en SS —Ҥ = => 


22 var Squared Rank 
Hartley 
* indicates that variables must be paired or blocked 
ie ——————————————————————————— ÁÀÁ 
cdf's of two variables against each other. Table 4 has a 


list of these tests, with their hypotheses, levels of пеаѕ- 
urement, and other considerations. 

The decision matrix indentifies tests suited to each of 
these purposes. The specific section on the test should be 
checked to be Sure that the test will do what is desired. 
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TABLE 2 


| 
| 


Nonparametric Tests for Location 





| 

| 

| | 

| HYPOTHESIS NAME i YARS LOM SEC PAGE | 

P (success) =P * Binomial 1 N ү 30 | 

| P (X<x*) =P* Quantile 1 О VI 35 | 

E (X) =E (Y) Sign 2* O IX 47 | 
Mann-Whitney 29" СО VIE 40 

Wilcoxon 2* I XI 65 | 

| I 

E (W) =.. . =E ( 2) согап 22* N XXIII 108 | 

Medi 22 О XVEL 83 | 

Krüskal- Wallis 22 О XVIII 88 | 

Friedman >2* О XXI 293 | 

Quade >2ж I XXII 108 | 

* indicates that variables must be paired or blocked | 

J 
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TABLE 3 
Nonparametric Tests for Indzpendence/Interaction 


| 

| 

| 

HYPOTHESIS NAME # VARS LOM SEC PAGE | 

| Trend(v. order) Cox-Stuart 1 О VII .43 | 
| Trend (X Y. Y) Cox-Stuart 2* О WET 03 | 
| 

Before/After McNenar 2* N X 51 | 

rho (X,Y) =0 R*C сз пзе су 2ж N XI 56 | 
(chi-square test | 

for indepencence) | 

NP Correlationt 2 О XIV 69 | 

E(YIX) sa+b*X NP Regression 22* І XV 74 | 

b#0 Monotone Regress 22 I XVI 79 | 

* indicates that variables must be paired or blocked | 

# gives pearson's and Spearman's rho,Kendall's Tau | 
к _ _ ИЗИНЕН эе сс а аас аа ааа 
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empirical cd fs 


а 
| 
| TABLE 4 | 
Nonparametric Tests for Location | 
| 
| HYPOTHESIS NAHE # VARS | LOM SEC PAGE | 
| var (W) = darin 52 T XX 96 | 
oe e=Var(Z) Squared Rank >2 I XIX 92 | 

| 
Fx (X) =G* (x) Ropu ogee wi <a) # 1 I XXIV 113 | 
| (specified Gcodn2ss of Fic 1 n RX 61 | 
parameters) (chisquare,N»14) ! 
| Fx (X) =G* (x) Jill i S6ors. (NO31) 1 I XXV 118 | 
(estimated (normal,expon2ntial) | 
| parameters) Goodness of Fit# 1 m XIT 61 ! 
| Wilkes- Shapiro 1 T XXVI 121 | 
| (norma 1,N<31) | 
F x (X) 2»Gx (X) SUMI DOM 2 О XXVII 12H | 
| F,G: Cramer-von Misas 2 О XXVIII 127 


# tests normal,uniform,exponential,weibull,erlang 


ee OX s UL s l 








C CRITERION TI: LEVELS OF MEASUREMENT 


Levels cf Measurement were treated in section II. 


IXCRITERICN III: RELATIONSHIP AMONG DATA ELEMENTS 


Data set variables may be unrelated, or paired (if 2 
И гарез) or blocked (if more than 2 variables). When 
unrelated, the order of variables in the first variable is 
not dependent on the order of of all other variables; e.g., 
the data elements of a variable could be entered in any 
order. In data sets with unrelated variables, the number of 
elements do not have to be equal. 

Paired or blocked data sets reguire sample sizes to be 
equal and the order of the elaments of the variables 
matters. Examples of pairing might be responses of husbands 
and wives, the intelligence of twins, the condition of the 
same experimental subject before and after the experimental 


treatment. 
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ERS SELEETING THE BEST TEST 


AS mentioned in section II, two different nonparametric 
tests applied tothe same data set could conceivably give 
different results to the same hypothesis. The example was 
the sign test and the Wilcoxon sign2d rank test. The reason 
given in section II--that those two tests extract different 
informaticn from the data--underscores the reason for the 
decision matrix. In general, the most appropriate nonparan- 
etric test is one that tests exactly what is desired, and 
uses all the available information contained in the data. 
Using all the available information implies selecting a test 
which requires the level of measurement of the data, and 
using a paired or blocked test if appropriate. 

These rules require thought. For example, the user 
Might want to know if procedure X produces better results 


than procedure Y. Mette stegiancs, that 25 4 problem of 
cation: is E(X)sE(Y)? However, it could also be consid- 
ered a prcblem of distribution: is Рх (2) =Ру (2) ? The test 


for distribution would check the empirical distributions 
across the entire range of observations, and thus use more 
of the informtion of the data set, particularly important 
with small data sets. 

The computer package, described in the previous section, 
makes running multiple tests on the same data set fairly 
easy. The user can explore the results of using different 


tests on the same data, to get insight into his problen. 
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V. BINOMIAL TEST 


A. OVERVIEW 


1. Purpose 
To characterize p, a" populatvconm's proportion от 
"Successes", based on the sample. 
2. Level of Measurement 
Nominal 
3. Assumptions 
The sample is randomly drawn from th2 population. 
Data may be divided into any dicotonous groupings, і. *., 
two mutually exclusive and collectively exhaustive groups. 
Here, the groups are call2d "succsss2s" and "failures". 
4. Hypotheses 
Let p* be a specified numbar 0<p*<=1, 
a. Two tailed test 
(1) Null hypothesis hd: p=p* 
(2) Alternative hypothesis h!: v#o* 
b. One tailed test 
(1) Null hypothesis h92: р<р* 
(2) Alternative hypothesis n1: p>p* 
C.e One tailed test 
(1) Null hypothesis hd: p2p* 
(2) Alternative hypothesis hi: p<p* 


B. ACTICN REQUIRED EY USER OP PROGRAM 


pn s él ab mmo mer nee gp о] Co сш oe пи ЭНӘ оч mp > a ЕЕ = OS ab Se sum sma 


The user specifies which treatment of the data sat 
is used for the test, a hypothesized proportion of successes 


2 


(pk) , and the desired level of significance (alpha*). The 
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program uses the count of successes and failures; there are 
three ways to indicate that dichotomization: 

a. The user enters cell counts. This is the only way 
to handle ncn-numerical data, but will work for any data. 

b. The user dichotomizes the data set by a user 
specified list. This is the way to handle numerical, 
nominal data. Data values which are elements of the speci- 
fied list are in one set; the rest of the data values are in 
the other set. The user indicates whether the list defines 
Success or failure. 

с. The data set is divided in two by a partition 
value. This is only possible if the data is numerical and 
at least ordinal. All elements above the partition valus 
are in one set; the user defines whether that set is success 
or failure. The values below the partition are the other 
type. 

2. Program Activation with Cell Counts (choice a above) 
a. Initiaticn and test selection 
in CMS, type LOCHI. En 
about previously entered data. Eur >" ЕС cel) counts: 


Ber бов пэ to question 


Peer oe "y", ...*2" for desired output file. When offered a 
Choice of tests, enter 1. 
b. Cell count entry 
Lochinvar will ask the number of successes and 
number of failures--enter those cell counts. 
C. Parameter entry 
Lochinvar will ask for a hypothesized proportion 
of successes P*, and a desired level of significance, 
alpha*. When prompted, enter those numbers 
3. Program Activation with Data Values (choice b or c) 
a. Initial Choices 
See section III.C for a detailed explanation of 
the foliowing. 
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In CMS, type LOCHI. If data is already in a data 
file, enter "Y" to first question, then enter file name; 
when offered a test, type 1. If data needs *o be entered, 
enter "N" for first question, "y" far second; when data 
entered, .choose to do a test; when offered a choice of 
tests, type 1. Pome Up"... "i for desired output 
file. 

b. Variable and Parameter Choice 
When prompted by Lochinvar: 
(1) Enter index of variable to be used in test. 
Enter whether dichotcmizing by list or by partition value 
(2) If by list, enter the number of elements in 
sat list. Enter elements of list. пег "уч; when 
prompted if elements on that list represent success. 
(3) If by partition value, enter that value. 
Enter "Y" if data elements below that value are successes. 
4. Test Executicn 
To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
5. Qutput Available to the Ussr 
The program CRUNCH displays the data file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and the variable used in the test are at the top of 
the output file. The empirical level of significance, 
alpha-hat, and confidence interval limits, are listed. 


C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


a. Count number of successes. Dow ge by “She total 
number of elements to give p-hat. 
b. If N<20, generate a binomial probability mass 


function with parameters N and p-hat. Otherwise, use the 





normal approximation to the binomial, with parameters 
N*p-hat and N*p-hat*(1 - p-hat). 
c. Compute tail probabilities 

(1) The upper tail probability is the likelihood 
that the number of successes is greater than or equal to the 
sanple number of successes. 

(2) The lower tail probabilty is the likelihood ` 
that the number of successes is lass than or equal to che 
sample number of successes. 

d. Conpute the level of signficance, alpha-hat 

(1) if h0: pSp*, alpha-hat equals the upper tail 
probability. 

(2) if h0: p2p*, alpha-hat equals the lower tail 
probabiltiy 

(3) if h0: p=p*, alpha-hat equals two times the 
minimum of the upper and lower tail probabilities 

e. Compute the interval, N less than 21 
(1) Let y1 = number of successes minus one, = 
index of the pmf of the lower tail probabilty. Vary p*, 
O<p*<=p-hat, until the lower tail probability equals 
alpha*/2. P* is the lower confidence limit. 

(2) Let y2 = number of successes plus one, 
index of the pmf of the upper tail probability. Vary p*, 
p-hat<p*<=1, until the upper tail probability equals 
alpha*/2. This p* is the upper confidence limit. 

f. Compute the interval, N greater than 20 
(1) Compute the standard error of estimate by 
taking the square root of the quantity (number of successes 
minus the number of failures, divided by N to the third). 
(2) Find the value z, which is the desired quan- 
tile, 1l-alpha*/2, of the standard normal distribution 

(3) The lower confidence limit is p-hat minus z 

tines the standard error. The upper confidence limit is 


p-hat plus z times the standard error. 
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2. Program Operations 
a. Determine the cell count 
b. If N is less *han or equal to 20 
(1) Generate a vector of N choose k, k=0,1,...,N. 
(2) Generate a binomial pmf, with parameters N 
and p-hat=number of successes divided by N. 
(3) Compute the tail probabilities 
(à Compute thes empirical level of significance 
(5) Using the previously generated N choose k 
vector, generate partial binomial pmf's, with terms to 
number of successes minus one, or number of successes plus 
one to N. A bisection search technique varies p, 0<p<=p-hat 
or p-hat<=p<=1, until the sum of the terms of the partial 
pmf is within 0.0001 of the desired level of significance 
(alpha*/2, because the interval is two tailed). 
C. If the sample contains mors than twenty elements 
Use the normal approximation to the  binonial. 
The subrcutine Muvar uses numerical integration to find tail 
probabilities; from which the empirical level of signifi- 
cance is estimated. Muvar also funishes the z quantile for 
computing the confidence interval limits. 
3. Program Characteristics 
a. Limitations 
Maximum number of data points: 400, if the 
program ccmputes the cell count from a data list; unlimited 
if the user enters the cell counts directly. Maximum number 
of variables: 1. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Binome,Which vec,Hyp 
(2) Crunch: Bintest; Muvar,  Cibi21 or Nchuzk, 
Benponf, Cibi20, Bisex 
(3) External: Shsort 
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А. OVERVIEW 


1. General Purpose 


U U 


E 


The quantile test addresses the question of what 
proportion of the population is less than or equal to a 
specified value. An example would be that a student might 
be concerned less with the average grade on a test than the 
proportion of students who receivald at least thea minimum 
passing grade. 

2. Level of Measurement 

Ordinal 


3. Assumption 





The data set is a random sanple from the population. 
4. Hypotheses 
Here, p* is the specifiei population proportion, 
OSp*<=1, and x* is a specified valua within the range of the 
population. 
a. Two tailed test 
(1) Null hypothesis hd: 
the p*th quantile = x* 
(2) Alternative hypothasis hi: 
the p*tth quantile # x* 
b. Ona tailed test 
(1) Null hypothesis hd: 
the p#*th quantile > x* 
(2) Alternative hypothasis h1: 
the p*th quantile < x* 
C. On2 tailed test 
(1) Null hypothesis h): 
th 


the p* quantile < x* 
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(2) Alternative hypothesis h1: 
the p*th quantile > x* 


B. ACTION REQUIRED BY USER OF PROGRAM 


1. User Decisions about Program Options 
The user must specify the values for p* and x*, and 
the letter of the hypethesis desired. 
If there is more than one variable in the designated 
data set, the user must specify which is to be used. 
2. Program Activation when Using Data Values 
a. Initial Choices 

See secticn III.c for a detailed explanation of 
the following. 

In CMS, type LOCHI. If lata is already in a data 
file, enter "Y" to first question, then enter file name; 
when cffered a test, type 2. If data needs to be entered, 
enter "N" for first question, UE ES seconds when data 
entered, choose to do А test; when offered a choice of 
Bestes, type 2. Enter у 2... "2" for desired output 
noe. 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter index of variables to be used in test, 
if there is more than one in the data set. 

(2) Enter the values of ре, х*, the letter of 
the chosen hypothesis, and the hypothesized level of signif- 
icance, alpha*. 

3. Test Execution 
Tc perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
4. Output Available to the User 
The program CRUNCH displays the data file to which 
the data has been sent, e.g., "U LISTING". The names of the 
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data set and variables used are at the top of the output 
ale. Also included are the test statistic (the number of 
data elements less than or equal to x*), the level of 
significance, and confidence limits for the population's 


p*th quantile. 


C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


- Calculation of Level of Significance and Confidence 


a. Compute the Test Statistic 


Crunch cempare the data elements wi-h x*, 
counting the number which are less than or equal to x*. DE 
there are no ties, Т1 = T2 = the count. If some elenents 
are equal to x*, then T1 = the count of data elements less 


than x*, and T2 = T1 plus the number of ties. 
b. Conputation of the Level of Significance 
If T1 = T2, the computation of the level of 
Significance is exactly the same as th2 binomial test, with 
T1 as the number of successes, the total number of data 
elements as n, and p* as the p* of the binomial test. T£ 
T1 < T2, then the lower tail probability is calculated using 
Т1; the upper tail probability uses T2; and the computation 
procedes as with a normal binomial test. Sea section V for 
details about computation of level of significance in the 
binomial test. 
Gamerand@the Confidence Limits 
The data elements are ranked; the ranked values 
are called "order statistics", so that X(1) is the smallest 
value, X(k) is the largest values in a data set of k values. 
Ihe confidence limits are found by computing the values г 
and s, which are indices of the order statistics; once they 
are found,  X(r) is the lower limit, X(s) is the larger 
sma c. 
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( 1) With less than 21 data elements, a binomial 
pmf is generated, with parameters k = number of data 
elements, and p= p*. R is found by summing terms of the 
pmf from the Oth term (P(N=0)) until the first term which 
causes the sum to be greater than the half of the hypothezed 
level of significance (alpha*/2). The index of that term is 
г. Sis found by summing terms of th2 pmf from the kth tern 
(P(N=k)) until the first term which causes the sum *о be 
greater than alpha*/2. The index of that term is S. 

This procedure is conservative on both bounds, and 
therefore may be very conservative overall. For example, 
with k = 18, p* = 0.5, and alpha*- 2.15, Crunch would choose 
or Sand s Sf 13; the probability that in the population 
an x would fall between X(5) and X(13) is 0.904, while the 
user would have been satisfied with about 0.85. With the 
binomial tablas, the user might choose indices 6 and 13, or 
ana. 13, which would have only 0.0481 in one tail but 
2551151 in the other, so that the test would have almost 
exactly the alpha desired. | 

(2) With more than 20 lata elements (=k), the 
hypothesized variance is k(p*(1-p*)); its square roct is the 
sample standard deviation. The expected value is kp*. R is 
the expected value plus the quantity standard deviation 
times the (alpha*/2) quantile of the standard normal distri- 
bution. Ms Not an integer, round up. S is the 
expected value plus the quantity standard devation times the 
(1-alpha*/2) quantile of the standard normal distribution. 
Again, round up if necessary to make s an integer. 

2. Program Method of Operation 
The subroutine celler calculates Ti and T2 by 
calling the subroutine dicot, which takes the values of the 
data set and find the number less than and the number less 
than or equal to the specified x*. The subroutine quantile 
receives that information, then calls the subroutine bintst 
to perform the binomial test described above. 
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Quantl then calls subroutines Nchuzk and Binpmf to 
generate a binomial pmf to calculate the confidence limit 
indices r ands, if the total number of values is 20 or 
less. Otherwise, Quantl calls the subroutine Normv for 
values of the (alpha*/2) and (1-alpha*/2) quantile of the 
normal distribution to use in the calculation of r and s. 
The data values are sorted by the subroutine RNQ, and the 
X(r) and X(s) values dereferenced as the confidence limits. 


3. Program Characteristics 





a. Limitations 
Maximum number of data points: 400. Only one 
variable is used. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Quantile 
ЕЛСИ Опа Е, Cellar, Dicot, Bintst; Rxng, 
Nchuzk, Binpmf or Nornv 
(3 External: Shsort 
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With two variables, test whether tha populations 
from which *hs variables were drawn have the same mean. 
2. Level of Measurement 
Ordinal 
3. Assumptions 


a. Each variable is a random sample. 


o 


. Thare is mutual independence between samples. 
4. Hypotheses 
a. Two tailed test 
(1) Nul3 hypothesis h0: E(X) =E(Y) 
(2) Alternative hypothesis h!: E(X)*sE(Y) 
b. On2 tailed test 
(1) Null hypothesis hd: E(X)2E(Y) 
(2) Alternative hypothasis hi: E(X)<E(Y) 
C. Ona tailed t2st 
(1) Null hypothesis hd: E(X)SE(Y) 
(2) Alternative hypothasis his: Е (Хх) >E(Y) 


B. ACTICN REQUIRED EY USER OP PROGRAM 


1. User Decisions about Proarın Options 
The user must decide which two of tha variables to 
WO. The 


ct 


usa in the test if the data set contains more than + 
user specifies the level of significance. 
ELO GB Sct iverson when Using Data Valuss 


a. Initial Choices 


Sesecect ten Lil. € fOr "“nitial choice details. 
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ПП шүре LOCH. “fi data is already in a data 
Pele, enter to first guestion, then enter file name; 
when cffered a test, type 4. If data needs to be entered, 
encer "N" for first question, "y" for second: when data 
entered, choose to do а test; whan offered a choice of 
tests, type 4. се IA. for desired out put 
file. 
b. Variable and Parameter Choice 
When prompted by Lochinvar: 
(1) Enter indices of variables to be used in 
test. 
(2) Enter the level of significance. 
3. Test Executicn 
Tc perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
4. Output Available to the User 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
mile. The empirical level of significance and whether to 


reject the null hypothesis are stated. 


C. ACTICNS PERFORMED BY THE PROGRAM CRUNCH 


alculation of Test Statistic/Level 


a. Sums of the Ranks of tha First Variable 
Rank the values of the two variables. Sun the 
ranks of each. The test statistic T is the sum of the ranks 
of the first variable. 
b. Overall Test Statistic 
The overall test statistic, if their are many 
pos, 15 Т1. Let n be the number of elements in the first 


variable, m the number of elements in the second variable; N 
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is the sum of n and п. Sum the squared ranks of both vari- 


ables to find R2. The numerator of the test statistic is T 
minus n(N+1) 2. The denominator is the square root of ths 
difference of two terms: R2*nm/N(N-1), and 


(nm (N*1) **2) /4 (N-1). 
2. Program Method of Operation 
The subroutine Manwht calls the nonimsl subroutine 
Utest, to get the level of significance. 
3. Program Characteristics 
a. Limitations 
Maximum number of data points: 400. Maximum 
number of variables: 2. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Manwhit 
(2) Crunch: Manwht 
(3) External: Ut est 
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А. OVERVIEW 


1. General Purpose 
This is a specialized fora of tke sign test, to 


determine if the data elaments in the second half of a 
sample are significantly greater or less than “the corre- 
sponding valuss in the first half. With ona variable, th 
order of the lata elements determinas how the data elements 
are paired for comparison. 

With two variables, in which the data elements ara 
paired, the ranks of the ¡independent variable orders the 
values of the dependent. This ylelds a rough test of +h 
interaction or dependence of the two variables. 

2. Level of Measurement 

Ordinal 

3. Assumptions 

a. The data elements represent random draws 

b. Either there is no trend, or there is an under- 
lying trend upward or downward throughout data. 

4. Hypotheses 
Where (+) indicates the second data element is 
greater than the first in the pair, otherwise (-), so that 
P(+)=P(-) indicates no trend, P(+)>P(-) indicates upward 
trend (1 variable) ог vary directly (1st variable *о 24 
variable), and P(+)<P(-) indicates iownward trend or vary an 
inverse relationship. 
а. Two tailed test 
(1) Null hypothesis h0: P(*)zP(-) 
(2) Alternative hypothesis h1: 2 (+) #P(-) 

b. One tailed test 

(1) Null hypothesis h0: P(*)SP(-) 
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(2) Alternative hypothesis h1: P(+)>P(-) 
c. One tailed test 

pet ness HO: Pp (+)2P (=) 

(Cia e ena tive hypothasis h1: P (+) <P(-) 


БОО ACTICN REQUIRED PY USER OF PROGRAM 


1. User Deci 
If one variable is used, make sure data elements are 
in the order (often order of tina) for ehe trend. .+o be 
tested. If two variables are to be us2d, data elements must 
be paired. The user selects the type of hypothesis and the 
level of significance. If there are two variables, the user 
specifies which is dependent. 
2. Program Activation when Using Data Values 
a.e Initial Choices 
See section III.C for a detailed explanation of 
the following. 


In CMS, type LOCHI. If data is already in a data 


Mee, enter "Y" to first question, then enter file name; 
when cffered a test, type 3. If data needs to be entered, 
Enter "N" for first question, "n CODE Second: when data 


entered, choose to do а test; when offered a choice of 
pests, type 3. Enmcocme WPLIM LI Iznror* destred output 
ile. 
b. Variable and Parameter Choice 
When prompted by Lochinvar: 
(1) Enter index or indices of variables to be 
used in test. EMS paros to be =xcluded, if using iwo 


variables. 


23 


(2) Enter letter of hypothesis tyve and level of 
= nificance. 
(2) If there are two variables, enter which is 


dependent. 
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To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
5. Qutput Available to the User 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data ser and variables used are at the top of the output 


file. 


С. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


1. Description of Calculation of Level of Significance 
а. fOrne pares (X,¥) Significance 
(1) If there is one variable, call it Y, and go 
to step (2). If two variables, let X be the independent 
variable. and Y be dependent. Find the order of the X. Use 
that ordering to re-order Y in ascending order of the values 
or X. 

(2) Let k be the number of elements in the vari- 
able Y. If K is even then there will be (k/2) pairs, if odd 
then ((k-1)/2) pairs. Let np equal the number of pairs. 

Besrand Test Statistic 
(MO: 1=1 to np, conpare Y (1) to Y(np-i+r1). If 
Y(1) > Y (np-1+1) then increment the counter for failures. If 
Y(i) < Y (np-i+1) then increment the counter for successes. 
The test statistic is from the binomial test, with n equal 
the number of successes plus failures, p* equal 0.5, and the 
number of success equal number of times the second half data 
elements are strictly greater than the corresponding first 
half elements. 
2. Program Method of Operation 
a. Ordering the Dependent Variable 
The subroutine Coxstu dereferences the X values 


into the INDEP vectoz, sorts *hoss values using Shsort, and 
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uses the indices returned in the vector KEY to dereference 
the Y values into a vector TLIST. 
If there is only one variable, use a do for loop 
to load its values, in their current order, into TLIST. 
b. Computing the Test Statistic 
The subroutine Coxstu calls the subroutine Signt 
to perform the sign test. See section IX. 
3. Program Characteristics 
a. Limitations 
Maximum number of data points: 400. Maximum 
number of variables: 2. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Coxstu 
(2) Crunch: Gegstu, Signt 
(3) External: Shsort 
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А. OVERVIEN 
l. General Purpose 
With pairs of data  (Xi,Yi), determine if the 
expected value of the first element, E(X), is greater than 
the expected value of the second elaneant, E(Y). 
2. Level of Measurement 


mined that 


than Xi (-), эг that they are equal. 
do not have to be ordered with any values 


E 


de 


b. 


n 
erdinsIzuithen each 


passe that is; ift can be deter- 


(+), or that Yi is greater 


The 


Xi is greater than Yi 
values of +h2 pair 


outside the pair. 


Assumptions 


Random Sample 

X,Y are a random sample 5f the population pairs. 
Level of Measurement 
Ordinal within sach pair, nominal among pairs. 
аашаа сей с= Probability that Xi > Yi 

the sample 


Ins sunderlyiag population feom which 


is drawn is assumed to have a constant P(X>Y). 


4. Hypothesis 
Here, (+) is the case that X>Y, (-) is the case that 
Y^X, where X,Y refer to population random variables. 


а. 


Ce 


Two tailed test 

(1) Null hypothesis h0: Р(+) =Р (-) 

(2) Alternative hypothasis h1: P (+) #P(-) 
One tailed test 

(1) Null hypothesis ҺО: 

(2) 
On2 tailed test 

(1) Null hypothesis h0: 

(2) 


PIE) 
Alternative hypothesis h1: р (+) >Р(-) 


P(*)2P(-) 


) 
Alternative hypothesis h!: P(*)€P(-) 
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В. ACTICN REQUIRED BY USER OF PROGRAM 


e data is non-numerical, but 
апа! within pirs, e.g., a list of winners of arm- 
wrestling contest between two platoons as a measure of arm 
strength. If so, then perform the test as a binomial test, 
entering cell counts, with number of successes as the number 
of times Xi >Yi, n is the total number of untied pairs, and 
p* as 0.5. 

With numerical data, the program will make the 
comparisons. The user must enter the hypothesis letter anrd 
the hypothesized level of significance. 

If there are more than two variables in the desig- 
nated data set, the user must choose which are included in 
the test. 

2. Program Activation when Using Data Values 

a Initial Chceices 

See secticn III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If lata is already in a data 
file, enter "Y" to first question, then enter file- name; 
when offered a test, type 5. If data needs to be entered, 
enter “N” for first question, "Vr or second. when data 
entered, choose to do а test; when offered a choice of 
tests, type 5. En er ио lu" tor desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter indices of variables to be used in 
test. Enter blocks or pairs to be excluded, if any. 

(2) Enter the letter of the hypothesis type and 


the hypothesized level of Significance, alpha*. 
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3. Test Execution 
Tc perform the test, after the above entry of da-a 
and parameters, type CRUNCH while in CMS. 
4. Output Available to the User 
The program CRUNCH displays th2 output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
file. The test statistic and the level of significance are 


printed, as well as whether to rejsct the null hypothesis. 


C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


1. Description of Calculation оё Level of Significance 
a. Calculation of the Test Statistic 
For each pair (Xi,Yi), Subtract Yi from Xi. 
Count the number of times the difference is greater than 0: 
this is the test statistic. The total number of untied 
pairs is the count of times the difference is not zero. 
b. Calculaticn of the Level of Significance 
The level of significance is calculated by the 
binomial test, With the number of successes equal to the 
test statistic, the total sample size equal to the number of 
untieá pairs, and the hypothesized proportion of success p* 
equal to 0.5. 
See secticn V for details about computation of 
levels of significance by the binomial test. 
2. Program Methcd of Operation 
The subroutine Signt compares the values of the 
lower indexed variable with the corresponding values of the 
other variable. Values within epsilon (=0.0091 by default) 
are considered tied. If not tied, then the program checks 
Busther Ki>Yi or Xi<Yi, and increments the appropriate 
count, (+) or (-). Sign: then Calls the subroutine S3intst 


to compute che level of significance. 
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3. 


{о 


rogram Characteristics 

а. Limitations 

Maximum number of data points: 400. Maximum 

number of pairs: 200. Required number of variables: 2. 

b. Subroutines/Procedures Used 
(M LOGhinvar: Signtest 
(Zeecrimen: Sgnt 
(3) External: Shsort 


o 





SS NENE AE ST FOR SIGNTEISANCESOE CHANGE 


A. OVERVIEW 





With data in pairs (Xi,Yi), and x and y values sach 
classifiable as successes or failures, to determine if a 
significant change has occurred. 

2. Level of Measurement 





Nominal 
3. Assumptions 
a. Th? pairs are a random sanple. 
b. X and Y can be divided ints mutually exclusive 
and collectively exhaustive groups: success and failure. 
C. Each of the k pairs cana be categorized inte one 
of four cells: success/success (1,1), success/failure (1,0), 
failure/success (Qe), and ftarluze/tanlturo (0,0). 
4. Hypothesis 
a. Null hypothesis 
90: Р (0,1) =Р(1,0) for all i 
b. Alternative hypothesis 
Н1: Р (0,1) Рр (1,0) for all i 


B. ACTICN REQUIRED EY USER OF PROGRAM 


If *h» data is nominal, the user must enter the 
counts of each of the four categories. 

If the data ís ordinal or above, the user may enter 
cell counts, or may specify partition values to dichotomize 
X and Y. КОТИ ОЛУ аг will ask for ¿ne partition value for X, 


and whether sıccesses are less than it. For example, if 
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I S [pe particion value for х, ала successes are greater 
than it, any X value greater than 17.5 will be a success; 
depending on its Y value, the pair will be counted in the 
success/success (1,1) cell, or success/failure (1,0) cell. 
After asking for the X partition value, Lochinvar will ask 
for the Y partition value and whether successes lie above or 
below. 

Tf there are more than two variables in the desig- 
nated data set, Lochinvar will ask which two are to be used 
for the test. 

2. Program Activation when Using Cell Counts 
a. Initiation and test selection 
СИС Ере ОСН Едер "NW for го to question 
about previously entered data. Enter” elfo. cell counts, 
Enter "U","Y",..."Z" for desired output file. When offered a 
choice of tests, enter 6. 
Er, Cell count entry 
Lochinvar will ask the cell counts for each of 
the four categories: Success/success, success/failure, 
failure/success, and failure/failure. 
Enter the hypothesized level of significance. 
3. Program Activation when Using Data Values 
a. Initial Choices 
See section III.C for a detailed explanation of 
the follcwing. 
In CMS, type LOCHI. If data is already in a data 
ENGL enter "Y" to first question, then enter file name; 
when offered a test, type 6. If data needs to be entered, 
enter "N" for first question, "yn for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 6. Ente UN Vie u er or desired output 
Eile. 
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b. Variable and Parameter Choice 
When prompted by Lochinvar: 
(1) Enter indices of the two variables +o be 
used in test. Enter blocks or pairs to be excluded, if any. 
(2) Enter the partition value for the lower 
indexed variable and whether successes are less than that 
value. Enter the partition value for the variable with the 
higher index; indicate whether successes are less than that 
value. 
(3) Enter the hypothesized level of significance. 
4. Test Execution 
To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
5. Qutput Available to the User 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
Erle. 

If there are 20 or less pairs in the data set which 
are classified as (1,0) or (0,1), then the test statistic is 
T2=number of pairs classified as (1,0); that test statistic 
and level of siginificance will be listed. Otherwise, the 
test statistic T1, explained below, and the level of signif- 
icance will be given. 

In either case, the program will compare the empir- 
ical level cf significance with the hypothesised, and state 
whether the null hypothesis is rejected. 


С. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


IE ecocenpe On Cl Calculation of Level of Siguiticance 


a. Computation of Cell Counts 
If cell counts were provided by the user, go to 


step b or c. If data values were used, Crunch examines each 
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pair (Xi,Yi) to see the relation sach data element has to 
its corresponding partition value. Crunch decides which 
s the pair belongs to, (1,1),...(0,0) , and increments the 
count for that cell. 
b. If Count for Cells (1,0) and (0,1) less than 21 

The test statistic is T2, which equals the count 
of cell (1,0). The binomial test is invoked to find the 
level of significance. T2 is the number of successes; the 
total count of cells (1,0) and (0,1) as n, the total sample 
size; che hypothesized proportion of success, р* 15 0.5. 
See section V for details of computation. 

c. If Count for Cells (1,0) and (0, 1) exceeds 20 
The test statistic is Tl, equal to the squared. 
difference between the cell counts (1,0) and (0,1), divided 
by their sum. This test statistic is compared to the chi- 
square distribution with one degree of freedom to find the 
level of significance. 
2. Program Method of Operation 
a. Cell Count Calculation 

The subroutine Celler compares each of the lower 
indexed values to the first partition value, then each of 
the other variable's values to the second partition value. 
At each comparison, a "1" is assigned to an integer vector 
if the value is a success, otherwise a "0" is assigned. The 
resulting integer vector, with the same number of elements 
as the original data set, is then used to find cell counts. 
With k pairs, the first through the kth elements of the 
integer vector are multiplied by two, then added to the 
corresponding k+1st through 2kth elements. The resulting k 
element vector has values 0,1,2,3, corresponding to the four 
cells. The vector is then used to compute cell counts. 

Desealeulaticn of the Test Statistic 

erumenstigures if the z211 counts 5f (1,0) and 

(WA) total more than 20. If so, T1 is computed, otherwise 


T2 is used. 
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Gawcad ciation Of the Level of Significance 
If T1 is used, the subroutine mcnemar performs 
linear interpolation of tabled values of the chi-square 
distribution with one degree of freedom. If T2 is used, the 
binomial distribution subroutine Bintst is used +o calculate 
the level of significance. 
3. Program Characteristics 
a. Limitations 
Maximum number of data points: 400. Maxinum 
number of pairs: 200 if program computes cell counts; no 
limit if the user enters cell counts. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Cel_test, McNenar 
(2 Crunch: AcNanmnr, Cellsr, McCount 
(3) External: Shsort 
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A. OVERVIEW 


1. General Purpo 


lo 


Used with a data set composed of vaired data 
elements, to test whether the classification or value of “he 
first element of the pair 15 statistically independent of 
the classification or value of the second. 

2. Level of Measurement 
Nominal 
3. Assumptions 
а. The pairs is randomly drawn from the population. 
b. The first element of each pair can be placed in 
exactly one of r categories (rows), and the second elemen- 
of each pair can be placed ¿in exactly one of c categories 
(columns). 
4. Hypothesis 
a. Null hypothesis 
HO:  P(paiíir is in row i and col j) equals P(first 
element in row i) times P(second element in row j) 
b. Alternative hypothesis 
ENE Das is In Tow I and col, 4) doses. not equal 


P(first element in rcw i) times P(s=2zond element in row j) 


Bee ACTION REQUIRED BY USER OF PROGRAMS 


1. User Decisions abzut Prograa Qotions 
q rn ry O£ cell counts 
If the data is nominal, the user must decide the 
Categories of the rows and columns, and enter the observed 


Gell counts, ab a for =3ch cell. The user may use this 


9 
C Cinique for data of a aigher level of measurement. 
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POPE LE O De d How. and column totals 
Tf the data is ordinal or above, and the categor- 
Ucatvon of the data points is by their order (e.g., not by 
whether their last digit is odd), the user may enter Ri's 
and Cj's. The sums of the Ri's and Cj's must exactly equal 
the number of pairs in the data set. 
C. Entry of row and column partition values 
If the data is ordinal or above, and the categor- 
ization is based on order, the user can specify (r-1) values 
to divide the data elements into r rows, and (c-1) values to 
divide the data elements into c columns. For example, 2 
data pair would fall into cell (1,2) if the first element of 
the pair were less than or equal to the first row partition 
value, and the second element of th2 pair were greater than 
the first column partiton value but less than or equal to 
the second. 
2. Program Activation when Using Cell Counts (choice a) 
a. Initiaticn and test selection 
In CMS, type LOCHI. Enter "N" for no to question 
about previously entered data. ЕК ор "снос Celco UNITS: 
Aer "UN MY, ... MZ" for desired output file. When offered a 


choice of tests, enter 7. 


/ 
р Ceri count entry 
Lochinvar will ask the number of rows and 
columns. It will then prompt entry of the observed cell 


esunt for each cell. 
J. Program Activation Using Data Values (choice b or c) 
a. Initial Chcices 
See secticn III.C for a detailed explanation of 
the following. 
In CMS, type LOCHI. I2 lata is already in a data 
file, enter "Y" to first question, then enter file name; 
when cffered a test, type 7. If data needs to be entered, 


Eur "NX" for first question, "ү ror seconds enter the 
q 
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data; Henao tened a Choice Sof tests, Еуре 76 Enter 
Uu" ror gessxred output file. 
b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter indices of the two variables to be 
used in test. Enter the number of rows and the number of 
columns. Enter whether using fixed row and column totals or 
partition values. 

(2) If by fixed totals, enter the row totals 
when prompted, then the column totals. 

(3) If by partition value, enter the row parti- 
tion values when pronpted, then the column partition values. 

4. Test Execution 
To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
5. Output Available to the User 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The data set and 
variable names are at the top of the output file. The test 
statistic T, the degrees of fresddn, and the matrix of 
observed cell counts are listed. 


C. ACTICNS PERFORMED BY THE PROGRAA CRUNCH 


1. Description of Computation of Iest Statistic 
The program Crunch will: 
a. Find the observed cell count 
Specify the r rows and c columns, thus creating 
r*c mutually exclusive cells. Count the number of pairs 
which fall into the categorization 5f each cell: that number 
is the observed cell count, Oij, of cell ij. 
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b. Find the expected cell count 
All the cell counts in raw i are summed to find 
the row total Ri. All the cell counts in column j are 
summed to find the cclumn total Ci. The expected cell count 
pis found by multiplying Ri times Cj and dividing by the 
total number of pairs, N. 
c: Find ene test statistic T 


коса пн се 50р Егасе 015 from  Eij. Square 
each difference. Divide each squared difference by the 
corresponding Eij. Sum those quotients to find the test 


statistic T, which can be compared to a chi-square distribu- 
tion with (r-1)*(c-1) degrees of freedom. The quantile of 
the chi-square distribution corresponding to T indicates the 
empirical confidence coefficient, or the complement of the 
empirical level of significance. 
2. Progran Operations 
a. Computation of the cell counts 
Ire ise sSUppireda the cell. counts, go to step 
b. If the user gave fixed row and column totals, rank all 
the first elements to find the raw partition values, and 
rank all the second elements to find the column partition 
values. Ccmpare the first element of each pair to the row 
partitions: assign to each pair the index of the largest 
petition value which the first eianent is less than; if the 
first element is larger than the largest partition value, it 
is assigned the integer r. Do the same with the second 
elements and the column partition values. Each pair now has 
two integers assigned, which designate membership in a row 
and in a column. Count the number of pairs falling in each 
cell: this is the observed cell count Oij. 
D. Compute the test statistic T 
Find the row aná column totals by summing Oij's. 
Compute the expected cell counts 51] by Ri*Cj/N. Square the 


e erence (O01j-EFij), divide by Eij, sum over all cells. 
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a. Limitations 
Maximum number of data points: MOO, Le othe 
program computes the cell counts; unlimited if the user 


supplies cell counts. Maximun number of variabiss/ 


treatments: 2. Maximum total cells (r*c): 50. 


b. Subroutines/Procedures Used 
(1) Lochinvar: Cel test,Rc cort; ог Which, test, 


Pemcont,sct cell ,Which vec, Exclud_b 
er Crunch: Celler, ,Partv,Tc3 or Tc4, Rectr,Rcont 


(SIP еса Shsort 
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1. General Purpose 
With a sample of 15 or more data elements,  *est if 
that sample could be drawn from a specified distribution. 


11 
2. Level o 


Where F* (x) is hypothesissi distribution function, 
апі Р (х) is the distribution of th2 population. 
a. Null hypothesis 
НО: F(x)=F*(x) тот ат 
b. Alternative hypothesis 
H1: F(x)*F*(x) £or a* laast one x 


B. |ACTICN REQUIRED EY USER OF PROGRAM 


about Progran Options 
decide among five distributions for 
which to test. Parameters may be specified by the user, or 
may oe estimated by the program (using method of moments). 
2. Program Activation when Using Data Values 
a. Initial Choices 

see ¡section III.C for initial choice details. 

JC MS UR DO LOCHI, —If data is already in a data 
file, enter "Y" to first question, then enter file name; 
when offered a test, type 8. If data nezds +9 ba entared, 
enter "N" for first question, EVN WEOs second: enter das*a, 


Ore 





and when offered a choice of tests, type 8. Enter 
aye OYN, e.s. "Z" for desired output file. 
b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter indices of variables tə be used in 
test. Enter blocks or pairs to be excluded, if any. 

. (2) Enter thaoretrcal drstribution . Choose 
among: uniforn, normal, exponential, erlang and wiebull. 


(3) Enter the parameters of the theoretical 
AS tribution, or request that tha program estimate the 
parameters. In the normal and exponential the method of 


moments can be invoked to allow tha program to estimate the 
maximum likelihood estimates of the parameters. In the case 
of the erlang, the user must specify the integer shape 
parameter n, but can allow the program to estimate the scale 
parameter lambda. In the case of the uniform and the 
weibull, the user must specify the parameters 
3. Zest Executicn 
| TON PERO nene test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
4. Output Available to the User 

The program CRUNCH displays the output file to which 
the data has been sent, €.J., "U LISTING". The names of the 
data set and variables used are at the top of the output 
file. The test statistic T and the degrees of freedom will 
be displayed. Finally, the matrix of cell counts, expected 
and observed, will be given. 


C. ACTIONS PERPORMED BY THE PROGRAM CRUNCH 


1. Description of Calculation of Test Statistic 
a. Calculation of the Expected Cell Counts 
Let k equal the number of data elements. EAE 





15<sk<=50, divide k by 5 and round to the nearest integer. 
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ill Бе С, the number of cells; 2.9., k=17 implies C=3, 
k=18 implies Z=4. In all celis except perhaps the first and 
last the expected cell count will be 5. If C*5=k, then the 
expected cell counts for all cells is 5. If C*5zk*1, һе 
Ex ccted Cell count of the first call is 4; if C*5=k+2, the 
expected cell count of the first and last cells is 4. ШЕ 
C*5zk-1, the expected cell count of the first cell is 6; if 
C*5zk-2, the expected cell count of the first and last cells 
is 6. For example, if k=17, C=3; since 3*5=17-2, the first 
and last cells have expected counts of 6, i.e., the expected 
cell counts are 6,5,6. 

If k>50, there will be 10 calls. Divide k by 10 
and truncate; that will be the minimum expected cell count 
E. The remainder of the truncation is rem=k-10*E; add 1 to 
the cell count of the last rem cells. 

The expected cell counts are stored in the 
vector EJ. This is the building block of the F=P (Xx) 
values of the theoretical distribution. Scan the EJ vector 
to find the CUM vector, ес, CUM (1) =EJ(1), 
BUML2)=SEI(IYHEI (2), ..., CUM(j)=sum ОЁ EJ(i), 1=1,..,ј. 
Divide each element of the CUM vector by the number of 
elements, k, to form the CUMP vector. 

b. The Partition Values 

The usual form of theoratical distributions takes 
a value x and returns a value F, such that F = P(X<x). In 
this case, the inverse form of the theoretical distributions 
is used to take an F, contained in the CUMP vector, and 
return an x. The first (С-1) values returned form the 
partition values which will divide those data elements into 
ls. 

C. The Observed Cell Count 

The data elements are compared to the partition 
values contained in the vector  PARIV. An element falls in 
the ith cell if it is larger than PARTY (i-1) but less than 
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or equal to PARTV(1). Elements less than PARTV(1) arz in 
the first cell; elements larger than PARTV (C-1) are in the 
last cell. The observed cell counts are contained in the 
vector RCCT. 
d. The Test Statistic 
For each cell, Suptfract  Bg(2) from  RCCI((2)., 
Square that difference, and divide that square by EJ(i). 
Sum over all cells. This 45 the test statistic T, with 
degrees of freedom (C-1). 
2. Program “Method of Operation 
The subroutine Gdfit calls tha subroutine  Muvar to 
find the sample mean and variance if parameters must be 
estimated. The subroutine Cpvgen figure the EJ, CUM, and 
CUMP vectors. The vector PARTV is figured in one of five 
subroutines, depending on the thecratical distribution spec- 
ified: unid, nornd, expd, erld, weid. The observed cell 
Esununts RCCT are found in Celcnt. tae test statistic T is 
computed in the subroutine Gdfit. 
3. Program Characteristics 
a. Linitations 
Maximum number of data points: 400. Maximum 
number of variable: 1. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Comro ce a Gi fit e, Gd fit<u, 
Et er, Gd fit w, Gd fit 
(Can che —Gourit, Covgen, Muvar, Celcnt, Unid, 
md, Expd, Erld, Weid 
(3) External: Shsort 
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A. OVERVIEW 


With two paired variables (X,Y), the signed rank 
test determines whether the expect2d values of populations X 


1. General Purpose 


and 7 could be the same. 


rh 


This requires interval data. I£ the data is only 


ordinal, use the sign test, 
2. Level of Measurement 
Interval 
3. Assumptions 
Thespaırs  (X:,7Y:2) are random samples from the popu-~ 
lation of all pairs. 
4. Hypotheses 
a. Two tailed test 
(1) Null hypothesis h0: E(X)=E(Y) 
(2) Alternative hypothasis h1: E(X)#E(Y) 
b. One? tailed test 
(1) Null hypothesis h2:E(X) SE(Y) 
(2) Alternative hypothasis hi: E(X)>E(Y) 
c. One tailed test 
(1) Null hypothesis hd: E(X)>E(Y) 
(2) Alternative hypothesis h!: E(X)<E (Y) 


B. ACTICN REQUIRED EY USER OF PROGRAM 


I. User Decisions about P 
neve Ole oa a Constant term can 
a so that the test 


be added to the higher indexed vari ; 
Sor (sand FIS) +b. 


becomes a question of the relationshi 
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By default, the value of b is 0; but the user can specify 
any real number. 
Lochinvar also asks for the hypothesis type and 
level of significance. 
2. Program Activation when Using Data Values 
a. Initial Chcices 

See secticn III.C for a datailed explanation of 
Ee following. 

In CMS. type LOCHI. Tf data is already ina data 
mere, enter "Y" to first question, then enter file папе; 
when cffered a test, type 9. If data needs to be entered, 
enter "N" for first question, uy" ror second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 9. шге ЕИ ALOE “desired out put 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter indices of variables to be used in 
test. Enter blocks or pairs to be excluded, i£ any. 

(2) Enter whether a constant term is to be added 
to the value of the higher indexed variable, Y. By default, 
Eme” constant is 0. 

(3) Enter the letter of the type of hypothesis 
and the hypothesised level of significance. 

4. Test Execution 
To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
5. Qutput Available to the User 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
file. 
Crunch will provide the sum of negative ranks and 


sum of positive ranks, as well as the test statistic Tin 
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all cases. If there are fewer than 20 pairs, table look up 
is required; otherwise, the program will compute the empir- 
ical level of significance. 

When the number of untied pairs is less than 20, the 
output will also contain the vector Dif, the differences 


Yi-Xi, in ranked order. 


Me ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


1. Description of Calculation of Test Statistic/level 
of Significance 
a. Sums of the Negative and of the Positive Ranks 
Two sums are calculated, one for the negative 
differences-- those pairs in which  Xi»Yi--and one for the 
positive differences. Pairs in which Xi=Yi are not consid- 
ered. Subtract Xi fron Yi, to fora the vector Dif. Take 
the absolute value of Dif to form Adif. Rank the values of 
Adif tc form the vector Rank. Troyes, “vthe 1th value of 
Rank is added to the sum of negative ranks, then that ith 
value is multiplied by negative 1. IE Yy the jth value 
of Rank is added to the sum of positiv2 ranks. 
оперисао од Of Test Statistic 
The vector Rank was originally the ranks of the 
absolute differences; now, the ranks for the negative 
differences have been multiplied by -1. Sum the vector Rank 
Bemetaind the numerator of the test statistic. Sum the 
squares of the elements of Rank to find the denominator of 
the test statistic. Divide numerator py denominator. 
c. Level of Significance 
If the number of untied pairs is 20 or less, use 
Fre A-13 “ref 1], using the sum of positive ranks. 
Otherwise, Crunch computes an approximation, using the test 


ЕМЕ БЕС T and the standard normal distribution. 
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2% 


HS Ito 


rogram Method of Operation 

he subroutine Wilcox finds Ше ОТТ and Adif 
vectors, and calls the subroutine Ranque (which in turn 
eis Rng) tò find the vector Rank. Wilcox computes the 
sums cf the positive and negative ranks, as well as the test 
Bnatistic T. If the number of  untied pairs is 20 or less, 
it will display the sums of ranks. Otherwise, it will call 
the subroutine Normv to find the approximate empirical level 
af significance, and display whether to reject the null 
hypothesis. 


3. Program Characteristics 





a. Limitations 
Maximum number of data points: 400. Maximun 
number of variables: 2; maximum number of pairs: 200. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Wilcoxon 
(2) Crunch: Wilcox, Ranque,Rng,Norav 
(3 External: Shsort 
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XIV. NONPARAMETRIC CORRELATION 


A. | OVERVIEW 


1. General Purpcse 





Nith na pairs cf data elements, X and Y, +est whether 
th» value of X is statistically independent of the valus of 
Y. or if it varies directly or inversely with Y. 

Three different test statistics are available by 
running this one program: Pearson's rho, Spearman's rho, and 
Kendall's tau. 

2. Level of Msasurament 
arson's rho to have meaning. 
Ordinat for Speasuaan's rho and Kendall's tau. 
3. Assumptions 
X,Y are a random sample of population pairs. 
4. Hypothesis 
Where rho is a measure of cocreala+tion,-1<rho<=1 
а. Null hypothesis 
HO: rho(X,Y) = 0, a necszssary condition for 
statistical independence 


Ww 


b. Alternative hypothesis 
HI TRSKA 0 


B.  ACTICN REQUIRED EY USER OF PROGRAM 


Q 
Iri 
py 
іы 
[o 
e 
1 
O 
на | 
(Л 


r of variables from a data 


1. User Decisions about Pro 





The user chooses which pa 
set are used in a test, and whe 
pairs from the test. As із ѕсгірей balow, the computaticn of 


1 
ther to exclude anv of the 


Kendall's “tau uses an algorithm of complexity ¡eval 


n-squared (when comparing 2 very pair with every other pair); 





if there are more than 40 pairs Lochinvar offers the user 
ne chance to skip the Kendall's tau test. т ес, only 
Spearman's rho and Pearson's rho (which may have no meaning 
if the data is ordinal) will be computed. Otherwise, all 
three will be produced. 
2. Program Activation when Using Data Valuss 
a. Initial Choices 

See section III.C for a detailed explanation of 
ES following. 

m cM. type LOCHI: Tf data is already in a data 
file, enter "Y" to first question, then enter file nane; 
when offered a test, type 10. If data needs to be entered, 
enter "М" for first question, Vio second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 10. E O tv "ror "desired output 
file. 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) If there are more than two variables in the 
data set, enter the indices of ths two to be used. Enter 
pairs to be excluded, if any. 

(2) If there are more than forty pairs, enter 
whether the Kendall's tau should be computed. 

4. Test Executicn 
Tc perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
5. Qutput Available to the User 
Tre program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
Be. moarnen S Eine, Jearson!s’rnha, and Kendall's tau are 
listed. 
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Soe ACTIONS PERFORMED BY THE PROGRAN CRUNCH 


1. Description of Calculation of Test Statistics 
a. Pearson's Rho 
Pearson's rho is SXY divided by square root of 
SXX times SYY. The deviations of X ar2 found by subtracting 
the mean of X from each value of X; deviations cf Y are Yi 
minus mean of Y. The product of deviation of Xi times devi- 
Amon of Yi, summed over all pairs, is SXY. The sum of 
squared deviations of Xi is SXX; the sum of squared devia- 
Bons of Yl is SYY. 
b. Spearman's Rho 
Rank the values of X. Rank the values of Y. 
Those pairs of ranks (R(Xi),R(Yi)), are then used to compute 
EY divided by square root of SXX times SYY. That jis, 
Spearman's rho is Pearson's rho performed on the ranks of 
the values rather than the values themselves. 
C. Kendall's Tau 
Tau 15 roughly the measure cf the number of 
concordant pairs minus disconcordant pairs divided by total 
number of possible pairing. Concordance implies positive 
correlaticn; e.g., if pair of the data set number 4 (X4,Y4) 
has an X value greater than, say, X5, and a Y value greater 
Ban YS, then this is one small indication that there is 
positive correlation between X and Y. Therefore, the 4th 
and 5th pairs of the data set form to make one concordant 
pair. If X6 is greater than X4, but Y6 is less than X4, 
this is a little evidence of negative correlation; the 4th 
and 6th pairs of the set form to make one disconcordant 
pair. Ties in either X or Y mean that the comparison yields 
neither ccncordance nor disconcordance. 
To compute tau, order the pairs (Xi, Yi) in 
ascending order of X's. Compare the first pair (the one with 


the lowest X value) to every other pair. If X1 is tied with 
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Pore (wlth ey, gO tO nel mext pair; if not, Юл су) 
moles concordance, (l > yi) implies disconcordance; 
increment the relevant count. After the fire: pair is 


compared with all others, the second pair is compared with 


all succeeding pairs (3,8,...,k), incrementing counts as 
necessary. Each pair is compared to all succeeding pairs, 
until Xk-1 is compared t5 Xk. Kendall's tau equals the 


number of concordant pairs minus the number of disconcordant 
pairs, all divided by the total possible number of pairings 
(k (k-1)/2). саатла are concordant, tau will equal 
Ball disconcordant pairings will yield an -1. 
2. Program Method of Operation 
a. Pearson's and Spearman's Rhos 
The subroutine Rhoer calculates SXY, SKX, SYY for 
any input values, and returns--inter alia--rhe. For 
Pearson's rho use the values of the data set. POT 
Spearman's rho, use subroutine Rng t9 rank the values of che 
data set, and send these ranks to Rhoer. 
Far Kendallırs- Tau 
Since the values have already been ranked, use 
Mer indices э? ranked X to order the pairs (R(Xi) ,R (Yi)). 
Ranks work the same as values for finding concordance or 
disconcordarce, but ties are easiar to discern with ranks. 
With nested do for loops, compare tha ith pair (i = 1 to 
k-1) with pairs j = i+1 to k; R(Xi) = R(Xj) or R(Yi) R(Yj) 
Mans to continue with looping. Otherwise, if R(Yi) < R(Yj) 


then increment concordant counter, if R(Yi) > R(Y3j) then 
increment disconcordant counter. At the end of the nested 
loops, divide the difference of th2 concordant counter minus 
the disconcorlant counter by the term (k(k-1)/2). 
3. Program Characteristics 
a. Limitations 
Maximum number of data points: 400. Maximum 

number of pairs: 200. 

b. Subroutines/Procedures Used 


T3 





(ie Lochinvar: Correls 
(2) Сеше Correl;Rnqgq,Rhoaer 
(Eternal: Shsere 
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OVERVIEW 


Regression tests the effect of the value of 


dent variable X has on the value of the independent 


Y. 


ang as the Y portion cf the pair. given K, i 


B. 


indepen- 


variable 


The linear regression model assunes that *he effect of X 
en is linear, that is, E(YIX)sE(Y)*b1(X-E(X)). This test: 


(1) Finds estimates of bi. 
(2) Predicts values of Y given values of X 
(3) Test if the linear modsl is correct 


(4) Finds confidence limits for the values of bl 


2. Level of Measurement 


Ordinal, to test the linear model 





Interval, for the other purposes listed above 


3. Assumptions 


a. Radom sampling of pairs. X may be speci 
2 


s 1 
D. The 3»eawmod2al is the correct modal. 
4. Hypothesis 
Where b1 is slop? the paranester; b1* hypoth 
a. Null hypothesis 
НО: b1=b1* 
b. Alternative hypothesis 
Al: b1#b1* 


ACTICN REQUIRED EY USER OF PROGRAM 





ei. Püsemweseisions ı5sut Progran Opr!ans 
the 
pen 
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esized. 


be used, 
will 





automatically fit the least squares model, finding b1. The 
intercept parameter b0, E(Y]/]X) =b0+b1*X, is also computed. 

b. The user can input values for X to predict values 
of Y based on the least squares parame-ers. 

C. The user can specify a hypothesised b1* to test 
whether a linear model is appropriate. The default value is 
b1* = 0; this is the classical tes: of whether an indepen- 
dent variable predicts a dependent variable. 

а. The user can request confidence intervals on the 
slope parameter bl. As described below, the algorithm to 
compute, those intervals is of degree of complexity n 
squared, that is, fer n pairs(x,y), n(n-1)/2 comparisons 
must be made and stored to find the confidence intervals. 
For larger values of л, the user should consider not 
requesting this opticn. 

I£ the user does choose to have “the confidence 
intervals computed, the value W will be requested by 
Lochinvar. That value is explained by Conover [Ref. 1: р. 
267], and contained in table A-12 of that reference. 

2. Program Activation when Using Data Values 

a. Initial Choices 

See secticn III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data ís already in a data 
file, enter "Y" to first question, ‘then enter file name; 
when offered a test, type 11. If data needs to be entered, 
Enter "N" for first question, Чү! £965 Second : when data 
entered, choose to do a test; when offered a choice of 
tests, type 11. к ериш Ves. 4 2" for desired output 
Ple. 

b. Variable and Parameter Choice 

When prompted by Lochinvar: 
(1) Enter indices of variables +) be used in 


Er. Enter pairs to be excluded, if any. 
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(2) Enter values cf X to compute E(Y/|X) based on 
least squares linear fit. 
(3) Enter hypothesized slope parameter b1*. 
(4) Enter whether confidence limits are desired. 
3. Test Executicn 
To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
4. Qutput Available to the User 
The program CRUNCH displays th? output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at che top of the out put 
je. The least squares fit parameters will be listed, and 
the Spearman's rho statistic for the specified value of bl. 
If the user requested expected values of Y for some values 
of X, the values of X and Y will be listed. Finally, if 
confidence bcunäs were requested, those will be given. 


C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


a. Find *he Regression Parameters b1,b0 


This step is identical to parametric simple 


regression. Find the means of X and Y, and the vectors of 
deviations (Xi-mean of X), (Yi-mean of Y). Multiply the 
corresponding deviations of X and Y together, and sum the 


Peogucts to forn SXI. Square the deviations of X, then sum 
those squares, to form SXX. Sum the squared deviations of Y 
to find SYY. The slope parameter b1 is SXY/SXX. The inter- 
cept parameter b0 is the mean of Y minus b1 times the mean 
En X. The expected value of Y given X is D0 plus b1 times 
Xç; Residuals are the difference between the expected and 
atual value of I. 
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БО е5 Е the аше OL X as a Predictor of Y 

In parametric regression, it is standard to test 
whether the slope parameter b1 could be 0 in the entire 
population. In this package, any value b1* can be set as 
the null hypothesis. The test checks to see if the resi- 
duals of the fit using b1* are correlated with the values of 
X; if not, then the fit is considered adequate. The resi- 
duals are (Yi-b1*Xi), the vector of residuals and the inde- 
pendent variable X are compaired using the Spearman's rho 
test. See section XIV for details əf that test. 

C. Predictions of Y based on Least Squares Fit 

With the least squares parameters bO and 1 
already computed, Crunch multiplies each given X times bl, 
adds bO, and displays the predicted Y. 

d. Confidence Bounds for tha Slope Parameter b1 

Form the vector of slopes S.  S(1) is (X(1)-X(2)) 
ВЕУ аеа by (¥(1)-¥(2)); S(2) is (X(1) -X (3))/(Y (1) -Y(3)) , and 
so on. With k pairs (X,Y), there are N=k(k-1)/2 elements of 
the slope vector, when comparing sach i with each other j 
once. 

Sone tne Weetor S. The indices of the confidence 
limits are r (0.5* (N-W)) and s (0.5*(NtW) +1), where N is the 
total number əf slopes and W is the user entered value from 
table a-1A of Conover. If necessary, round r or s up to 
integers. The lower limit on b1 is S(r), the upper limit is 
5 (5) . 

2. Program “Method of Operation 

The subroutine Regret calls the subroutine Rhoer to 
find the least squares parameters b1 and b0. Regret figures 
the residuals for the specified b1*, and sends the residuals 
to subroutine Correl to have Spearman's rho calculated. 
Regret figures the vector »9f slopes S; that vector is sorted 
by Shsort; Regret finds indices s and r and the coniidence 
mitis to b1. 
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3. 


ity 


rogram Characteristics 





a. Limitations 
Maximum number of data points: 400. Maximum 
number of variables: 2. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Regret 
(2) Crünchni Regret, Rhoer, Correl 
(3) External: Shsort 
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A. OVERVIEW 


1. General Purpose 
With independent variable X, and dependent variable 
Y, find the E(YI|X) equation, based on a least squares fi* of 
the ranks of X and Y. Linear interpolation finds values from 
EE cL E(rank of.X) to E(rank of Y) to S(Y). 
2. Level of Measurement 
Interval 
3. Assumotions 
a. The sample is a random sanple 
b. The regression of Y on X is monotonic. 
4. Hypothesis 
This test does not produce atest statistic; the 
assumption is that the monotonic moi3l is correct. Instead, 
it will predict expected values of Y for given values of Y, 
Miso, it can produce a trace of tha monotona regression, 
which'is aia lo gous to the straight line of tha least squares 


fit of linear regression. 


ШОО ACTICN REQUIRED PY USER OF PROGRAM 


1. User Decisions about Progran Options 
The user must specify the index for the independent 
value X, and dependent variable Y. Tha user can include 


values of X to have predicted a corresponding value of Y. 
The user can request e trace of the monotone regression. 
2. Program Activation 


a Initial Choices 


sS SE S On ITIT.C for initial choice details. 
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Ia CMS, туре LOCHI. If data is aiready in a deta 


file, enter "Y" to first question, then enter file name; 
when offered à test, type 12. If data needs to be entered, 
enter "N" for first question, (“urco Second; when data 


entered, choose to do a test; when offered a choice of 
tests, type 12. пер ОНЕ ИСС ЧАИ for desired output 
file. 
b. Variable and Parameter Choice 
When prompted by Lochinvar: 
(1) Enter indices of variables X and Y. Er ter 
pairs to be excluded, if any. 
(2) Enter whether to predict values cf Y for 
specified values of X. If yes, enter values of X. 
(3) Enter whether a traca is desired. 
3. Test Execution 
To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
4. Output Available to the User 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
file. In all cases, there will рг listed the parameters bO 
and bi, the intercept and slope of the least squares fit on 
the ranks of X and Y. ihe the uses specified X's to predict 
Y's, the predicted values will ba listed. Finally, t a 


trace was requested, the pairs of X's and Y's will be given. 


C. ACTIONS PERFORMED BY THE PRCGRAM CRUNCH 


1. Description of Calculations 
a. Least Squares Fit on Ranks of X and Y 
Sort and rank X and Y. With n pairs, X(i) and 
Y(j) are the order statistics, aX(i) and RY(j) are the 


ШКЕ, i-z1,...,n, hs1,...,n. Compute the least squares fit 
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БО їп to and bl, such that E(RYIRX) =bO+b1*RX. See section 
Xv for details of least square computations. 
b. Predicaticn of Y Based on Values of X 

Linear regression will extrapolate values of Y 
Bomea given value of X i£ that X is outside the range of ths 
cbserved values X(1), ...,  X(n). Monotone regression will 
fer extrapolate; if the user specifies an X outside the 
range of the observations, that X will not yield an expected 
Y. 

Let Xp be one of the values the user specified to 


Bredict a Y, Yp. Find the pair of values of the order 
Ea tistics Of X which bracket Xp, X (i) <Xp<=X (i+1). The 
expected rank of Xp, RXp, must fall between those order 
statistics! ranks, RX(1) S<RX p<=RX (itl). Use linear interpo- 


ШЕТ оп to find  RXp: RXp= RX (i) (Хр-Х (1) ) / (Х (1+1) -Х(1)) 
* (RX (1+1) -RX(1)). 

Use the parameters bO and bi to find RYp, the 
mak of Yp: RYp=bO+bi*RXp. 

Find the ranks от Үү which bDrackot RID; 
BR) SRYP<=RY(J+1) . The value of Yp lies between Y(j) and 
+1). oemi mear interpolation to find Yp: Yp= Y{j) + 
EEO RY(j)) /Z@Y(3j+1)-RY(j)) *(Y(j+1)-Y(j))- 

C. Trace of Monotone Regression 

The trace isa set of linəs, between a sequence 
of points (Xi,Yi), which describe the monotone regression. 
When plotted, the trace can be used to predict a value of Y 
Ban X. This is similar to using the straight line of 
linear regression, Y=b0 + b1*X, for prediction. 

nd ne tendpornts ot the trace, use X(1) and 
X(n), and the procedure of part b, to find E(YIX(1)) and 
ИХ (n) ). 

Intermediate points of the trace will use the 
values of Y to predict expected X. Zach value of Y has a 


ЕСИ BY, which is used with b0 and b1 to find RXt (trace), 
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RXt= 


bracket it. 


(RY-E0)/b1. 


КОТ ОП ЕХ, find the peir of RX*s which 


Use linear interpolation to find Xt. 





2. Program Method of Operation 
The subroutine Monore calls Rnq to rank and sort 
values. The subroutine Correl finds the least squares 
parameters b0 and b1 from the rank vectors RX and RY. The 
subroutine ESTY performs estimations of Y for given X's, 
calling Rtov (which calls Interp) t» do the linear interpo- 
lations. The subroutine Trace performs the trace, calling 
or, Rtov and Interr. 
3. Program Characteristics 
an Linitations 
Maximum number of data points: 400. Maximum 
nunber of variables: 2. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Monoreg 
O unch Monor ROY; Veor, Esty, Interp, Rang 
(3) External: Shsort 
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XXVII. MEDIAN TEST 


Be OVERVIEW 


1. 


me all 


Purpose 


With two or more samples, this test will determine 


the populations from which the samples were drawn 


could have tha same median. 


As described below, һе package allows combining of 


data set variables into composite variables in order *o *es-* 


populaticn groups for common median. 


2. 


3% 


The madian шау be a surrogate for expected value. 


Level of Measurement 


Ordinal 
Assumptions 


The variable were randomly irawn from the respective 


populations, and are mutually independent. 


4. 


nith k popúulations, AZ ERA Viand. wohne 


median denoted as M, 


a. Null hypothesis 
HO: M (X17) =M(X2) =... =M (Xk) 
b. Alternative hypothesis 
Н1: М (Хі) #М (Х1) for at least one of the pairs 


В. ACTICN REQUIRED EY USER OF PROGRAM 


1. 


include 





Aggregate variables may bea formed by having Crunch 


a 
combine «he expected cell counts of several variables into 


84 





on2 or more aggregate variables. For example, let data set 
Farmland contain six variable, in order, A through F. 
Variable A is corn yield per acre of farm land treated with 
fertilizer and pesticide. pana Orn yl elimi Eh fertilizer 
Mee no pesticide. C is corn yield with pesticide but no 


fertilizer. D is corn yield per acre from land untreated by 


either fertilizer or pesticide. Variables E and F have 
other values. Variables À and C could be combined into 
aggregate variable 1, and Band D into 2, to test whether 


the median yield with pesticide is squal to the median yield 
without pesticide. Sima cian a В, znd Cand D, could 
be combined into aggregate variables 1 and 2, to test the 
median of yields with and without fertilizer. 
Proper plannina and labeling of the variables will 
facilitv this aggregation. 
2. Program Activation when Using Data Values 
a. Initial Choices 
See secticn III.C for a detailed explanation of 
the following. 
Fis a De LOCHI; It data is already in a data 
Meee, enter "Y" to first question, then enter file папе; 
when offered a test, type 13. I= data needs to be entered, 
enter "М" for first question, М ЕЭ весовая: when data 
zo 


entered, choose 


+ 


do a test; when offered a chol ca of 


~ 
= 
-r 


ES. type 13. Picea Ves a Tor desired output 
pnl 


о 


b. Variable and Parameter Choice 
When prompted by Lochinvar: 
(ПОО ег? лаал сеш ОТ Varlables to bs used in 
fest. Enter blocks or pairs to be axcluded, if any. Enter 
whether *o us? aggregate variables. 
CUT т 


ces Of Variables *o be used in the test 


(0 
1 
ct 
р 
3 


£ aggreaata variables are nor used, 
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(2) If aggregates ar2 used, Lochinvar will 
Beompt for a choice on each data set variable, = J tono 
number of variables in the data set. ТЕ theo n data 
variable is to be excluded from the test, enter 0. It it is 
Кере includel, enter th2 index of the aggregate variaols 
into which it should be combined. For exampie, with “he 
Farmland data set described above, Lochinvar would prompt 
user choices on data set variable i= 1 to 6. Let the tas- 
oMa comparison oof yields with or without pesticide, ‘thus 
variables A and C into aggregate variable 1, B and D into 2, 
and E and F excluded. When prompted for zhe data Set var 1, 
énter 1. For data set var 2, Enter 2. For data set var 3, 
enter 1. Por data set var 4, entar 2. For data set var 5 
and 6, enter 0. 
4. Test Execution 

Tc perform the test, after the above entry of data 

and parameters, type CRUNCH while in CMS. 
5. Output Available to the User 

The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
fae. ENS progran will print the test statistic and the 
degrees of freedom, as well as a two by number of variables 
contingency table, showing how many elements of each vari- 
able are less than or equal to the grand median of all vari- 


ables, and how many are greater than. 


See ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


Desert ion of Calculation of Test Statistic 
calcula tion Of the Cell coun<s of the M Matrix 
The M matrix has two rows and a column for each 
variable in the test. The A row has the number of elements 


in each variable less than or equal to the grand median; the 
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B row has the number of elements greater than the grand 
median. The values of all the variables are copisd into a 
vector, which is sorted; the median of all those values is 
the grand median. fne counts orl the calls of Zhe matrix M 
are set to 0. Each element of the first variable is conpared 
to the grand median; if less than or equal, the A row first 
column cell count is incremented, otherwise increment B гом 
meest column cell count. After all elements of the first 
variable are compared to the grand mean, do tha second vari- 
aoe, incrementing the 2 column's cell count, et cetera. 
b. Conmpikeamg the Test Statistic 

сыш а ccunts of the A row to find Atot, and all 
Buts of B to find Bror. Sum Atot and Brot to find N. Sum 
BUS first column counts to find NAW (1), the second column 
counts to find NAV(2), 3st cetera. The expected cell count 
for cell aj is Atot*NAV(j)/N; for bj, Btot*NAV(j)/N. The 
tase is іп essence the chi-square test for independence (r*c 
Bemtingency table), with the test statistic as the sum over 


each cell of observed minus expected count squared, divided 


by the cell expected count. The actual aquations used 
Шр ОС1< the structure of the probien. The degrees of 
freedcm are (r-1)(c-1) = number of variables minus one. 


2. Program Method of Qperation 

a. Aggregation of Variable 
The subroutine Media assumes that aggregate vari- 
ables are always used. when not, the program uses k aggre- 
gate variables where k equals the number of data set 
variables included in the test; aggregate variable 1 is data 
set variable 1,...,aggregate variable k is data set variable 
К. When aggregates are in fact specified, their valuss, 
ED. ained ià ZO vector, will be usei. In ei*her casa, “¿he 
vector AGVARI will contain the indices relating each data 


se* variable to an aggregate variabls. 
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Decco: counts 
Tho pa r e M 1swastıctinn,. The vector NA is the 
count of elements less than or equal to the grand median; 
it is arranged according to aggregate variable indices. NB 
is the count »f elements greater than the grand median. In 
Bio fer loop i=1 to the number of data set variables, k is 
Set to AGVARI(i), and each element of the ith data set varl- 
able is compared to the grand median. If the element is 
less than or equal to the grand aedian, increment NA (k) 
(NÀ(AGVARI(i)), otherwise, increment NB(k). 
C. Compute the Test Statistic 
NA(i) and NB(i) are sumaed to find NAV (i). All 
elements of NA are summed to find A, all elements of NB are 
Filed to find В. A plus 3 equals N. The test statistic is 
computed using the specialized equations of Conover {ref 1, 
р. 172]. The degrees of freedom are the number of aggregate 
variables minus one. 
3. Program Characteristics 
a. Limitations 
Maximum number of data points: 400. Maximum 
number of variables or aggregate variabies: 15. 
b. Subrzoutines/Procedures Used 
(1) Lochinvar: Median, Ag med,Which, vec,None ex 
(2 Crunch: Media 
(3 External: Shsort 
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ТОЕ ЕО КАИ ЯАГЬТ5 TEST 


no OVERVIEW 


1. General Purpose 

With a data set of nore than two  variablss, 
X1,X2,...,Xk, test whether the expected values of all vari- 
ables are equal. 

ТЕКЕ 'c null hvpoths5sis 2s rejected,  *hen it 
Memopeassible €5 determine which of the pairs of variables 
have an unequal mean. 

If the data set is blocked, consider instead of this 
test the Cochran (ncminal) , Frieiman (ordinal) or Quade 
meerval) «ests. If paired, consider the Sign (ordinal) or 
oxcn (interval) tests 

2. Level o£ Measursmuen 

Ordinal 


3. Assumptions 


|, 





a. Fach variable is a random sample from within its 
populaticn. 

b. The sampling was mutually independen* among the 
populations. 

C. The population distributions are identical, or 


alse some of the populations tend tə have higher values than 


others. 
4. Hypothesis 
а. Null hypothesis 


HO: E(X1) E(X2) 5... E(Xk) 
b. Alternative hypothesis 
Hi: Does for at Least one Hair of popula- 
tions represented by the data sat. 
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Ban CL LGN REQUIRED EY USER OF PROGRAM 


pzions 


=a a 


1. User Decisions abouz Progran 
The user must specify which variables of a data set 
are included in the test 
After the test is run, Crunch will present the 
overall test statistic, and a series of comparisons between 
the variables. If, and only if, the null hypothesis has 
been rejected by the overall test statistic may the user 
infer any information by the comparisons between the pairs 
NX) of variables. 
2. Program Activation when Using Data Values 
a. Initial Chcices 

See сес са III.G for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y" to first question, then enter file name; 
when offered a test, type 14. If data needs to be entered, 
enter "N" for first question, wy" for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 14. Boece a a Еос desired output 
ple. 

b. Variable and Parameter Choice 
When prompted by Lochinvar, enter indices of 
variables to be used in test. 
3. Tesi Execution 
To perform the test, after the above entry of data 
while in CMS. 


4. Output Ayailabie io the Ussr 


= 


and parameters, type CRUNC 


The program CRUNCH displays the output file to which 
the data has been sent, 2.9., "U LISTING". The names of the 
data set and variables used are at the top of the output 
pole. The overall test statistic will be given. А150 


listed will be comparisons of pairs of variables; these 
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comparisons only have meaning if the null hypothesis nas 
been rejected. The paired comparison are essentially at 
test run on the ranks of the two variables being compared, 
testing the hypothesis that their msan ranks are the sane. 
Crunch supplies the absolute difference in their mean ranks 
and the pooled standard deviation; the user must find the 
Eoropriate value of t to conduct the = test. 


СО ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


1. Calculation of Test Statistic Zar the Null 
Hypothesis 
a. Find the Sums of Variable Ranks Ri 
Rank all values in the data set. Sum the ranks 


in the ith variable to get the value Ri. The number of data 
elements in the ith variable is ni, so the average rank in 
Be ith variable is Ri/ni. The sum of ni over all k vari- 
ables is N. 
БОЕ: ПЛ КЫ Тест Statistic 

The test statistic numerator is the sum over k 
treatments of (Ri**2)/ni, minus ((N+#1)**2) «Миц. To find the 
E statistic denominator, 52, first sum over all data 
elements the square of the ranks. Subtract from this sum of 
squares the term ((N+1) **2) *N/4). Divide that difference 
by (N-1) to get S2. Davede the numerator by S2 to find the 
test statistic T. 

C. Make Comparisons of the Pairs of Variables 

Às mentioned above, this step is the student + 
test on the ranks of two variables to see if their popula- 
tion mean could be equal; it means nothing if the null 
hypothesis nas not been rejected. 

Her each pair of vaziables Ki,Xj, find the abso- 
lute difference in mean ranks,  |Ri/ni-Rj/njl. Find the 
moled Variance, (S2*(N-1-T)/(N-k)) times (1/ni*1/nj). The 
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ps 


pooled standard deviation is the square root of the poole 
h 


pooled standard deviation times the desired value of <, 


ct 
(D 


variance. If the absolute difference is greater than 


reject the assertion that the mean of populations 1 ала j 
are the same. 
2. Program Method of Operation 
The subrout 
values of the data set. In a nested do for loop, i=1 to k, 


ine Kruwal calls Ranque to rank the 


j=1 to ni, the intermediate terms of the statitics are 
computed. Kruwal prints the overall test statistic to the 
specified output file. It then makes pairwise listings of 
the absolute difference of mean ranks and the corresponding 
pooled standard deviations. 


3. Program Characteristics 





a. Linitations 
Maximum number of data points: 400. Maximum 
number of variables/treatments: 15. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Kruwal 
(2) Crunch: Kruwal, Rangue 
(3) External: Shsort 
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AS OVERVIEW 
1. General Purpose 

With two or more treatments i a data “Set, “Ses: 
whether the population variances of those treatments аге 
equa_. 

2. Level of Ysasursment 

Interval 

3. Assumptions: 

Randon sampling within each treatment pvonula+ion, 
and among +trəatment populations. Level of measurement is 
interval. 

4. Hypothesss 
a. Two tailed test, where X,Y are treatments 
(1) Null hypothesis hO: var(X)=var (Y) 
(2) Alternativ= hypothesis h1: var (X) #Zvar(Y) 
b. Опе tailed test, two treatments 
(1) Null hypothesis hO: var (X) >=var (Y) 
(2) Alternative hypothesis hi: var(X)<var (Y) 
с. Опе tailed test, two treatments 
(1) Null hypo*hesis h0: var (X)=<var (Y) 
(2) Alternativa hypoth2sis һ1: var(X)>var(Y) 
d. Nunber of treatments (Xi) are three or mors 
(1) Null hypothesis hd: var(X1)=var (X2) = 
eee =Var (Xk) 
(2) Alternative hypothesis hi: var(Xijp#var (Xj), 
meee some 1,1, indices of the data set 
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POS ACTTICN REQUTRED BY USER OF PROGRA 


1. User Decisions about Program ODt29nsS 
The program allows the usar to choose which treat- 
ments to include in the test. 
2. Program Activation when Using Data Values 
a. Initial Choices 
See section III.C for a detailed explanation of 


mue following. 
US Ay pS OS HE Тї data 1s already in a data 
MiS. enter "Y" to first question, then senter file name; 
when offered a test, type 15. If data needs to be entered, 
emer "ит for first question, EEST second: when data 
entered, choose to do a test; when offered a choice of 
tests, type 15. гооп "Шун UA for desired output 
file. 
b. Variable and Parameter Thoice 
When prompted by Lochinvar Enter indices of 
variables to be used in test. 
Y. Test Execution 
Tc perform the test, after the above entry of data 
and parameters, type CRUNCH while ino CMS. 
5. Output Available to ihe Ussr 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables appear on the top of the output file. 
Also included are the test statistic and the number of 


treatments used *o compute the statistic. 


94 





C. ACTICNS PERFORMED BY THE PROGRAM CRUNCH 


1. Description of Calculation of Test Statistic (11/12) 
a. Find the ranks of each absolute deviation 
Compute the sample mean for each of k treatments. 
Subzrac- the treatment mean from each of the treatment's 
elements. Take absolute values of each of those differ- 


ences. Rank all N absolute differ2nces (N=total elements in 
data set). 
b. Complete intermediate statistics: 
(1) SUMR, the sum of all Tanks. 
(INSTA Ene SUN Of the square of ali ranks. 
(3) SUMR4, the sum of ali the fourth power of 
each rank. 
(4) SUMRJ, he k element vector with the sum of 
the ranks for each treatment. 
(5) SUMR2J, the k element vector with the sum of 
squared ranks for each treatment. 
(6) R2MU, the average squared rank: SUMR2/N. 
(7) RMU, the average rank: SUMR/N. 
C. For two treatments compute the statistic T1 
(1) Numerator: (SUMR2J (1) -n1*R2MU) 
(2) Denominatior: SUMRU*ni*n2/(N*(N-1)), minus 
(RMU**2) *n1*n2/ (N-1) 
(3) Ti=numerator/denoninator 
d. For three or more treatments compute T2 
(1) Numerator: difference between these two 
terms, sum over `k treatments of (SUMR2J**2) /ni, minus 
(R2MU**2) *N 
(2) Denominator: (SUMRY-N*(R2MU**2)) / (N- 1) 
(3) T2=numerator/denominator 
2. Program Operations 
The progzam uses subroutine  MUVAR to find treatment 
means. The subroutine Squark finds th2 absolute deviations; 
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while the subroutine Rnq ranks then. Dhezsypzoutire Squark 
Uses nest "do for" loops, i=1 to k (treatments) and j=1 to 
ni (elements in treatment), to caluculate the vectors SUMRJ 
and SUMR2J, and the sums SUMR, SUMR2,  SUMRA. RMU and R2MU 
are found by dividing SUMR and SUMR2 by М. Depending on the 
number of treatments, T1 or T2 is found by simple arithmetic 
using the intermediate statistics. 
3. Program Characteristics 
a. Linitations 
Maximum number of data =12ments is 400. Maximum 
number of treatments is 15. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Squarank, which_vec 
(cC ronch:bSquarkyWuqar, Riq 
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A. OVERVIEW 


Test for equal variance among the к underlying popu- 
lations of the variables or treatnants, reflected by the 
values for those treatments in the data set. 

2. Level of Measurement 

Interval 

3. Asssumptions 
Random sampling within each treatment, independence 
of sampling anong treatments. Interval level of measurement 
4. Hypothesis 
a. Null hypothesis 
HO: var (X1)=var (X2) =...=var(Xk), whare k= number 
of treatments 
b. Alternativa hypothesis 
H1: var (Xi)#var(Xj) for at least ona pair 


В. ACTION REQUIRED EY USER OF PROGRAM 


1. User Decisions about Program Options 


FF| шс meh mm Sm rn s s SS qu. чы ыр «u aw = слын mw man d af чыл шы <s 


The program allows the user to choose which treat- 
ment 


5 
2. Program Activation 


Onne demin the "sat, 
N 


a. Initial Choices 
Se-EcICCtoconm PETPSC for initial choice details. 
а 


In CMS, *voe LOCHI. If gata is already in data 
Bere, enter "Y" to first question, then enter file name; 
when offered 2 test, type 16. If data needs to be entered, 
enter "N" for first question, uc" second: when data 
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entered, choose to do a test; when offered a choice of 
tests, type 16. PHS UD for desired ou: put 
file. 
b. Variable and Parameter Choice 
When prompted by Lochinvar, Enter indices of 
variables to be used in test. 
4. Test Execution 
To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
5. Qutput Available to the Ussr 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
Erle. The there are the maximum and minimum variances and 
indices of the associated treatments. Finally, there is the 
WWstatistic, to be compared to a table prepared for the 


Hartley test. 


fee ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


1. Description of Calculation of Iest Siatistic 
Crunch does the following: Compute the sample vari- 
ance for each treatment. Find the smallest and largest 
variance. Divide the largest variance by the smallest vari- 
ance to yield the test statistic Р. Compare the test 
statistic to a special table to see if the null hypothesis 
should be rejected. 


2. Program Method oí Ope 


ry 


ration 

The treatment variances are figured in the subrou- 
tine MUVAR. The subroutine HARTLE scans the variances to 
find the largest and the smallest, divides the former by che 


latter, and gives that number as tha test statistic F. 
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Ing 


rogram Characteristics 





a. Limitations 

The maximum number of data elements is 400; the 
number of treatments 15 15. 
b. Subroutine/Procedures Invoived 

(1) Lochinvar: Hartley, Which, vec 

(2 Crunch: Hartle,Muvar 

(3) External: Shsort 
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XXI. FRIEDMAN TEST AND DURBIN TEST 


A. OVERVIEW 


1. General Purpose 
To perforn a treatment and block test, similar to 
ANOVA, using only ordinal da*a. In fact, “the data may be 
rankable only within each block; no v по relationship 


between data values in different blocks is necsssary. 
Ohe oe sy ts fOr data Sets sf complete 
block design, i.e., each treatment is applied to each block. 
The Durbin test is a generalized Friedman *es+, for 
use with incomplete block design. With t treatments and b 
blocks, each treatment is applied tor blocks (r<b), and 
each block receives k treatments (k<t). The user must guar- 
antee that the design is balanced, and insert a filler value 
in place of missing data values when entering the data set. 
Each test will generata a sat of differences 
between pairs of treatment summed ranks. . If, and only if, 
the null hypothesis is rejected, these differences may be 
u Sd, in conjunction with à standari deviation value and the 
student's t distribution, to determine íf thera is a signif- 
icant different between two treatments. 
2. Level of Measurement 
Ordinal within sach block. 
3. Assumptions 
Values within blocks are inispenden* among blocks. 
4. Hypothesis 
a. Null hypothesis 
HO: Each possible ranking within the block is 
equally likely (that is, the treatment effects are equal) 
b. Alternative hypothesis 
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H1: At least one of the treatments tends to ge- 


higher ranks than at least one other treatment. 


B. C TICN REOUTRED EY USER OF PROGRAM 


1. User Decisions about Program Options 
The user must decide which treatments and blocks to 
include in the test. ІЁ а incomplete block design is to be 
used, the user must specifically enter the data sec for this 
purpose: there will be t variables, and b elements entered 
for each variable. фп осе эше Биен have “a no data 
value (a treatment which was not applied to some block), 
enter a filler value. In a data sat, there can b2 only one 
ANS: value, and it must be differant than any of the data 
values. For example, if all the data values are positive, 
-1 could be used as a filler. 
2. Program Activation when Using Data Valuss 
а. Initial Choices 
See secticn III.C for a detailed explanation of 
the following. 
Intens, type LOCHI. If ilata is already in a data 


file, enter "Y" to first question, then enter file папе; 
when cffered a test, type 17. If data needs to be entered, 
Eur "N" for first question, IN ES E Second; when data 


entered, choose to do a test; when offered a choice of 
tests, type 17. Pies О ТИИС НАН сос “desired out put 
mele. 
b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter whether to use the Friedman or Durbin 
test. 

(2) If the Friedman test is chosen, enter the 
treatments to be used and any blocks to be excluded. 
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(2) ОТО ene Durbin test is chosen, enter the 
treatments to be used, any blocks tə be excluded, k= number 
of treatments applied to each block, r= number of blocks to 
which each treatment is applied, and the filler value. 

4. Test Execution 
To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
5. Qutput Availabl2 to the User 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data sex and variables used are at the top of the output 
Ele. The overall tes: statistic for the null hypothesis 
will be given. In addition, the difference between pairs of 
treatments! summed ranks will be listed; these are mean- 


ingful only if the null hypothesis is rejected. 


meee ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


me escent ion of Calculation of 


i3 


2st Statistic 
а. Use of Filler, If Durbin Test 
If the Dimer cest is indicated, find the maximun 
value in the data set. Add 1 to that maximum value *o form 
Met. In a do for loop, substitute MP1 for the filler value, 
and create the integer vector INTEST. INTEST(i) equals 1 
if the ith element in the data set is not the filler value, 
0 if the ith element is a filler. 
b. Rank Values Block by Block 
In both tests, find the ranks of the values of 
the data set block by block. Load those ranks into the R 
vector. 
In th2 Durbin test, the filler elements will have 
a higher rank than these of data elaments, since the fillers 
were loaded with MP1. Therefore, after all the ranking is 
Euolstoe, multiply the rank vector -imes INTEST; this sets 
the ranks of all fillər elements to 0. 
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C. Suns of Treatment Ranks and Total Sums 
Square each rank of tha rank vector R, and sun 
those squares over all treatments and blocks to form A2. Sum 
+he ranks of each treatment to form the vector ЕЈ;  RJ(j) 
contains the sum of ranks of the jth treatment. 
d. Test Statistic for the Friedman Test 
Square each element of the RJ matrix; sum those 
Squares; divide that sum of squares by the number of blocks 
to find B2. The dencminator of the test statistic is A2-B2. 
Let b be the number of blocks, t the number of treatments. 
mimes numerator of thé test statistic is (b~t) * (B2 -0.25bk* 
(k+1)**2). Divide the numerator by the denominator to get 
Bus test statistic Т2. Compare Т2 <O the F distribution 
with (k-1) and (k-1) (b-1) degrees of freedom 
e. Test Statistic for the Duzbin Test 
Sun the squares of RJ. Multiply that süm of 
squares time 12(t-1). Divide that product by rt (k-1) (kr), 
where r is the number of blocks to which each treatment is 
applied, and k is the number of treatments applied to each 
block. This quotient is the first term of the test 
statistic. The second tern is 3r(t-1) (k+r1) divided by 
(k-1). Subtract the second term from the first term to find 
the test statistic T. T is compared to the chi-square 
distribution with t-1 degrees of freedon. 
f. Paired Comparisons of Treatments 
Crunch will print the following numbers in all 
cases, but they only have meaning when the null hypothesis 
is rejected. For each i,j of the treatments, the absolute 
difference between the sums of treatment ranks is listed. 
The number that is ecuivalent to th2 standard deviation is 
also computed. For absolute differences greater than the 
tandard deviation times the t quantile (1-alpha/2), the 
treatments can be considered to have a different effect. 
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2. Program Method of Operation 
т 


tions. The subroutine Ranque ranks the values 
block. The subroutine Parcom does the comparisons 
ment sums of ranks. 

3. Program Characteristics 


a. Limitations 





Maximum number of data points: 400. 
number of variables/treatments: 15. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Friedman 
(2 Crunch: Friedm, Ranque, Rng, 
(3 External: Shsort 


Parcom 
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A. OVERVIEW 


(D 


M. eneral Purpos 


= IM 


ith interval data organized into t treatments and b 
blocks, test whether all treatments have the same effect. 
If not, treatnents can be compared for differences in pairs. 
2. level of Measurement 
Interval 
3. Assumptions 
a. The values of sach blocks are independent of the 
values of the other blocks. 
b. The sampling within each block can yield a range 
of the elements; that range must b2 rankable among blocks 
4. Hypothesis | 
a. Null hypothesis 
HO: Each possible ranking within the block is 
equally likely (that is, the treatment effects are equal) 
b. Alternative hypothesis | 
H1: At least one cf the treatments tends to get 
higher ranks than at least one othar treatment. 


В. ACTION REQUIRED BY USER OF PROSRAM 


1. User Decisions abou 
d 
e 


he treatments to be included 


hed 
+ 


in the test, and whether any blocks are to be excluded. 
Crunch will provide the absolute differerce between pairs of 
treatments; if and only if the null hypothesis is rejected, 
these comparisons can be used *o test whether the effects of 


the pairs are equal. 
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2. Program Activation when Using Data Values 
a. Initial Choices 
Sea section ІІІ.С for a detailed explanation of 
the following. 
In CMS, type LOCHI. If data is already ina data 
file, enter "Y" to first question, then enter file name; 


when offered a test, type 18. If data needs +o be entered, 
enter "N" for first question, uy" for second; enter the 
data: when offered a choice of tests, type 18. Enter 
"jJ" "y" HZ for desired output fils. 
b. Variable and Parameter Choice 
When prompted by Lochinvar: Enter indices of 


variables to be used in test. 
3. Test Execution 
To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
4. Output Available to the User 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at che top of the output 
file. The test statistic T and its degrees of freedom are 
listed. Finally, the absolute difference between the pairs 
of sums of treatment ranks are given. These differences, 
when used with a standard deviation computed by Crunch and 
the t distribution, can be used tə check for equal effect 
between pairs of treatments. 


C. ACTICNS PERFORMED BY THE PROGRAM CRUNCH 


1. 


іо 


escription cf Calculation of Iest Statistic 
a. Rankings within and among Blocks 

Rank each element in the block; assign the 

elements to the matrix of ranks R. Subtract the largest 


element from the smallest element; that becomes the block's 
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range, stored in the vector R. After all blocks have had 
their elements ranked, rank the vector R; those ranks form 
the vector Q. 
De Thensizs Matrix 5 
With t equal to the number of treatments, the 
average rank of Ris (t+1)/2. Subtract that average rank 
from each element of R. Multiply the reduced ranks of the 
first block by Q(1), the reduced ranks of the second block 
by Q(2), et cetera, tc form the matrix S. 
с. Sums Of the Treatment Elements of S 
Sum the elements of the jth treatment cf S over 
elie blocks to form the jth element of the vector SJ. Square 
each element of SJ, sum those squares over all treatment, 
End divide by'b, the number of blocks, to find B1. Square 
all elements of S, and sum all those squares *o find Al. 
de fest Statistic 
The test statistic T1 is (b-1) time B1 divided by 
the quantity A1 minus B1. Compare T1 to an F statistic with 
degrees of freedom (t-1) and (b-1) (t-1). 
e. Pairwise Comparisons 
These values have meaning only if the null 
hypothesis is rejected. Compute the absolute difference 
between each pair of the vector SJ. If that difference is 
greater than th2 student's + quantile times the computed 
standard deviation, the pair of treatments can be said to 
have a different effect. The standard deviation is found by 
taking the square root of (2b(A1-B1)/ (b-1) (t-1)). 
2. Program Method of Operation 
The subroutine Quade calls Ranque to find the matrix 
Капа the vector Q. The test statistic is computed in 
Quade, as is the standard deviation. The subroutine Parcon 


makes the pairwise ccmparison. 
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3. Program Characteristics 


a. Linitations 


Maximum number of data points: 400. “Maximum 
number of variables/treatments: 15. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Quade 
(2 Crunch: Quade, Ranque, Eng» 
(ЗО External: Sasozt 


Parcom 
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А. OVERVIEW 


1. General Purpose 
With a data set of b blocks, to which k treatments 
have been applied, test whether the treatments have been 
equally effective. "Effective" must be reducible to two 
categories, mutually exclusive and collectively exhuastive. 
Here, these categories will called success and failure. 
Rio ДОЛ S not rəducipbple in this fashion, the 
Friedman (ordinal) or Quads (interval) tests are available. 
2. Level of Measurement 
Nominal 
3. Assumptions 
The blocks are a random sample from among all blocks 
4. Hypothesis 
а. Null hypothesis 
HO: The treatments are equally effective 
b. Alternative hypothesis 
Hl: There is à difference in effectiveness among 


th» groups. 


B. ACTION REQUIRED PY USER OF PROGRAM 


1. User Decisions about Progran Options 


The user must decide the standard for success or 
т 


failure of each treatment. If the data is nominal, the user 


Should enter cell counts. See paragraph 2, below. If the 
data is ordinal, tne user may enter a partition value for 
each treatment of a data set; see paragraph 3. [CDS 


latter method is used, Crunch will classify each value of an 
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entered data set as a success or failure by its relationship 
to the Partition value for its treatment. 
par erronea lides are us>d in conjunction with a 
data set, the user may specify which variables of the data 
set are to be included in the test, and may designate blocks 
to be excluded. 
2. Program Activation when Using Cell Counts 
a. Initiaticn and test selaction 
a De LOCHT. “Enter "NU for no to question 
about previously entered data. Enter sales: or cell counts. 
Er owe, ty"...7" for desired output file. When offered a 
choice of tests, enter 19. 
Dcerrocount entry 
Lochinvar will ask how many treatments and how 
many blocks are included in the test. After those numbers 
ar2 entered, Lochinvar will ask whether block 1, treatment 1 
was a success. Enter yes or n0. The program will prompt 
for each treatment in block 1, then move to block 2, and so 
rerh, through the b*k choices. The matrix of 1's 
(successes) and O's (failures) will be displayed, and the 
user has the chance to correct the values for any block (the 
program will prompt fer entries of sach treatment of a spec- 
med block). 
3. Program Activation when Using Data Value 


{Л 


a. Initial Choices 

See section III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
file, enter "Y" to first question, then enter file name; 
when offered a test, type 19. If data needs to be entered, 
БИЕ Чч for first question, INES second: when data 
entered, chose to do a test; when offered a choice of 
tests, type 19. Ps оинлч fon. desired ous put 
file. 
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b. Variable and Parameter Thoice 
jJ chino a rj ill ask Ir treatment .1 is in the 
test. If yes, then it will then ask the partition value for 
treatment 1, whether successes lis above or below that 
value. The same questions will be asked for sach treatment. 
After each treatment is either excluded or given a partition 
value, Lochinvar will offer the user the chance to exclude 
blocks frcm the test. 
4. Test Execution 
To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
5. Qutput Available £9 the User 
Тһе program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
file. The test statistic T will ba listed. 


C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


A Macao Bleck and Treatment Totals 

Tf the user chooses the method of cell counts 
(paragraph 2, above), Lochinvar will sum the successes over 
each of b blocks and over each of к treatments. These 
Fals will bs in the cells vector of line 3a of the options 
file (see Appendix D for details of tne options file), to be 
read by Crunch. 

If the user chooses to enter partition values 
(paragraph 3 above), Crunch uses that information to compute 
the block and treatment totals. 

b. Computing the Test Statistic 

The numerator of the test statistic involves the 

Sums of successes for each treatment. The total number of 


successes N is found by summing ach treatment number of 





successes Сі: Н divided by the number of treatments k is the 
average number of successes per treatment. Find the sum of 
squared deviations by summing ovez k treatments the square 
of (Ci-N/k). Multiply that sum of squares by k(k-1) to get 
the test statistic numerator. 

The denominator of the test statistic is the sun 
over all b blocks of the number of successes in bleck i 
times the number of failures in block i. 

Divide the numerator by the denominator. 

2. Program Method of Operation 
a. When Using Cell Counts 

In the procedure Cochran, Lochinvar prompts the 
answer of yes or no to the question of whether block 1, 
treatment j, is a success. If yes, it increments the total 
of successes in the count for block 1 and treatment j. The 
total for block i is contained in the ith element of the 
esor CELLS; the total for the jth treatment is contained 
in the (j+b)th element of CELLS, where bis the number of 
blocks. The number of blocks  (NROW) and number of treat- 
ments (NCCL) are written in Line 3 5f the options file; the 
vector CELLS is written in Line 3a. The Crunch subroutine 
Celler reads those values, and sends them to the subroutine 
eschten, which computes the test statistic. 

b. When using Partition Values 

In the procedure Cochran, Lochinvar asks the 
partition value of each treatment included in the test; 
those values are entered in the odd numbered elements of 
vector PART_V, Ша о еее (2 КЕТ: where k is the 
number of treatments. In the even numbered elements of 
PART_V is a +1.0 if successes are below the preceding parti- 
КООП value, ОГОО Т ene Successes are above it. SOE 
example, PART V of 3.0 -1.9 7.0 1.0 means that there are two 
treatments; in the first treatment, successes are above 3.0, 


in the second, Success2S are below 7.0. The number ot 
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blocks  (NROW) and the number of  *reatments (NCOL) are 
Witten in the Line 3 of the options file; PART V is written 
Into line 3a. 

The Crunch subroutine Celler calls the sub-outine 
Coch to use PART V and the data elements values to find the 
totals of successes for the blocks and treatments. ТЕ 
places those counts (in the same format as CELLS, describe 
in subparagraph a, above) into the vector RCCT. That vector 
is used by the subroutine Cochrn to compute the test 
Beatistic. 

3. Program Characteristics 
a. Limitations 

When the user enters cell counts, the sum of the 
treatments and blocks cannot exceei 50 (b*k), and the total 
number of cells (b*k) cannot exceed 400. 

When the program computes the treatment and block 
totals, +he maximum number of data points is 400; the 
maximum number of treatments is 15. The sun of treatments 
and blocks (b+k) cannot exceed 50. 

b. Subroutines/Procedures Used 

(Y Lochinvar: Cochran, cel_test 

(7| crumch: Gochrzmn,celrsr 

(3 External: Shsort 





XXIV. KOLMOGOROV TEST 


A. OVERVIEW 


hos 


іа 


eneral Purpose 

With a sample of 30 or less data points, test to ses 
whether the population from which th2 sample is drawn is of 
a specified distribution with specified parameters. The 
choice of hypothesised distributions is uniforn, normal, 
exponential, weibull, and erlang (gamma with an integer 
shape parameter). 

The Lilliefors test is idantical to the Kolmogorov 
test except that it estinates parameters from the sample; 
normal and exponential distributions can be hypothesised. 

2. Level of Measurement 

Interval 
3. Assumptions 

The sample is a random samples from the vopula*icn. 
4.  Hypothesss 

With РЖҰ as the theoretical distribution with 
specified parameters, and S(x) as the population's urler- 
ang distribution function, the hypotheses are: 

a. Two tailed test 

(1) Null hypothesis hO: F* (x) =S(x) for all x 
(2) Alternative hypothesis hí!: F*(x) #S (x) for 
at least one x in range of sample 

b. On? tailed test 

(1) Null hypothesis hd: F*(x)<S(x) for all x 
(2) Alternative hypothesis h1: F*(x)><(x) 

c. One tailed test 
(1) Null hypothesis hd: F* (x) 2S (x) 

(2) Alternative hypothesis hi: P¥*¥ (x) <S (x) 





B. ACTION REQUIRED EY USER OF PROGRAM 


a Set has more than one variable, the user 
must specify which is used. Т? the chosen variable has more 
than 30 elements, the chi-square goodness of fi* test will 
automatically be given. Otherwise, the user must specify 
the distribution to use, and all its parameters, from among 
the fcllcwing: 

a. Normal (specify mu, sigma squared) 

b. Exponential (specify lambda=rate,scale) 

с. Uniform (specify a=lower bound, b=upper bound) 

d. Erlang (specify n=integer,shape; lambda=scals) 

е. Weibull (specify alpha=shape, lambda=scale) 

2. Program Activation when Using Data Values 
апта СһҺоО1Сёе5 

See section III.C for a detailed explanation of 
Ene fcllcwing. 

In CMS, type LOCHI. If data is already ina data 


mle, enter "Y" to first question, then enter file name; 
when offered a test, type 20. If data needs to be entered, 
enter "N" for first question, I| "uror Second; when data 


entered, choose to do a test; when offered a choice of 
tests, type 20. пег UM unl. Uu" ror desired output 
file. 
b. Variable and Parameter Choice 

When prompted by Lochinvar: 

(1) Enter index of variable to be used in test. 

(2) Enter number of theoretical distribution. 

(3) Enter parameters as prompted. 

(4) Enter letter of type hypothesis. 





Bee lese § xeocu] On 


Tesperzermschenvest „after the above en data 


ct 
r1 
a 
Ó 
th 


and parameters, typ2 CRUNCH while in CMS. 
4. Qutput Available to the Ussr 

The program CRUNCH displays the output file to which 
the data has been sent, =2.9., "U LISTING". The names of the 
data set and variablss used are at the top of the output 
file. The values of the largest difference F*(x)-S(x), the 
smallest (most negative) difference, and the greatest abso- 
lute difference are displayed. Depending on the hypcchesis, 
Crunch selects the proper value of these three as the test 
statistic, and displays that value. 


Co ACTICNS PERFORMED BY THE PROGRA4 CRUNCH 





а. ПЕ Саю аиа Distribution Function (CDF) 
Sort the data elements, giv: ng Order statistics 
X(1) ,...,X(n), where n is the number of data elements. The 
empirical cdf equals 0 for values of X lower than the 
smallest X, X(1). КОИ the caf JumpS co 1/n; “he caf 
increases by 1/n at each new order statistic. At X(n), the 
EN Urical cdf equals 1, and remains 1 for all values of x 
greater than the sample. The empirical cdf is stored in the 
sector S. 
popolne s on the Theoretical CDF 
With a specified distribution and parameters, 
find the value of the cdf for sach value of the order 
Fu r istics X(1) ,... ,/, X(n) . For exanple, with exponential 
(lambda=2) specified and X=1.5, the value of the theoretical 
caf is 1.0-єхр (- (2ж1.5)) =0.9502. tore these values in the 
E cor PF. 





Es Difference between Empirical and Theoretical 

CDF’ S 
With a sample of size n, there are 2n differences 

between S and F. For example, at X(1), S jumps from 0 to 
1/n, so that F(1) is compared to both values. At each order 
Statistic value, the theoretical distribution fF (i) as 
compared to S(i-1) and S(i). Crunch keeps track of the tmin 
(most negativa difference, or if all are positive, the 
smallest positivs), and tmax (most positive, or if all are 
negative, the closest to zero). 

d. Tesz Statistics 

For a two tailed t2st (hypothesis A), the test 
statistic is the larger cf the absolute values of tmax and 
tmin. Fer hypothesis B, tmax is tha test statistic. Pos 
hypothesis C, tmin is the test statistic. 
2. Program Method oi Operation 

The subroutine Kolmog compute the empirical cdf, by 
BO for lcop i1 to n, with S(1)=1/n. Shsort orders the 
sample values to form rder statistics. The theoretical 
Ss are found in one oí the £cllowing subroutines: nornf, 
ENDE, unif, erlf, and weibf. 

The subroutine Pindt finds the differences between S 


and F for each value of X. For i=1, F is compared to 0 and 


EUM . БОЛТА УС sor loop 21-2 to n, F(3) is compared to 
S(i) and S(i-1). These 2n difference are stored in the 
vector T. Another do for loop, j=1 to 2n, finds tmax and 
enin. From those two values, the greatest absolute differ- 


ence is found. 
3. Program Characteristics 
a. Limitations 
Maximum number of data points: 400. Maximum 


nunber of variables/treatments: 15. 





Б. Subroutines/Procedures Used 
Mserochinwvarssskolnoe, Ораг, Wpar, Erpar, Npar, 
Expar : 
ОШООО КОШО, Fiadt; Norms, Unit,  EXpf, 
Belf, Weibf 
(3) External: Shsort 





A. OVERVIEW 


1. General Purpose 
With a sample of less +han 31 elenents, this *ests 
whether the sample came from a specified distribution, 
either a normal or exponential. The parameters used for tha 
test are the maximum likalihood estimates (MLE), derived 
from the sample mean and variance. Those +wo values are 
computed and displayed in the output fils; however, 
Mepliefors tests only th= distribution typ2, not the param- 
eter value. (Апу value other than che MLE would have a 
higher probability of rejecting thea null hypothesis). 
If specific parameters must be tested for, use the 
Kolmogorov test. 
2. Level of Measuremsat 





Interval 
3. Assumptions 
The data values are a randon sample of the popula- 


4. Hypothesis 
With P*(x) dseenosenwEO*thssisedwedistsibutrTonm func- 
tion, with MLE parameters, and S(x) as the actual population 
distribution 
a. Null hypothesis 
HO: F*(x)=S(x) for all x 
b. Alternative hypothesis 
H1: Рж(х) #5 (х) for at least one x 





e ) AGTICN REQUIBRED EY USER OF PROGRAM 


1. User Decisions about Program Options 
If more than one variable is in the data se+, the 
user must specify which one to utilize for the test. Er 
that variable has more than 31 elemants, the goodness of fit 
test will be offered automatically. Otherwise, choose 
whether to test for the normal cr the exponential distribu- 


tion. 


See section III.C for a detailed explanation of 
the follcwing. 

In CMS, type LOCHI. If data is already ina data 
file, enter "Y" to first question, then enter file name; 
when offered a test, tyve 21. If data needs to be entered, 
enter "N" for first question, "ү" for second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 21. Eper "Uu Uyui n. "Z" for desired output 
Eule. 

b. Variable and Parameter Choice 
When prompted by Lochinvar: 
(1) Enter index of variable to be used in test, 
if there is more than one in the data set. 
(2) Enter whether to test against the best 
СЕБЕ: па normal or exponential distribution. 
3. Test Executien 
Tc perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
4. Output Available to the User 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set and variables used are at the top of the output 
ple. The sample mean and variance are listed, as well as 
the test statistic. 
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C.  ACTICNS PERFORMED BY THE PROGRAM CRUNCH 


1. Description of Calculation of Test Statistic 
a. Finding the MLE Parameters 
The sample mean and variance are computed. Te 


mmeetest is for normality, the theoretical distribuition?s 
parameters mu and sigma squared are estimated by the sample 
mean and variance respectively. If the exponential is the 
theoretical distribution, its parameter lambda is the recip- 
rocal of the sample mean. 
be Calculation of the Test Statistic 
The remainder of the test is exactly the same as 
the two- tailed Kolmogorov test. The ampirical distribution 
function is generated using (1/n) increments for each of the 
n sample data elements. The difference between the empir- 
ical and theoretical distribution is computed at each value 
of x contained in the sample. The greatest absolute differ- 
ence is the test statistic. 
2. Program Method of Operation 
The subroutine Lilief calls Muvar to find the sample 
mean and variance, then assigns MLE values to the rp vector. 
The subroutines Norf cr Expf, as appropriate, are called to 
generate values of the thecretical distribution at the 
values of X(1) ,-.. X (n). The subroutine Findt finds the 
largest, smallest and greatest absolute difference; that 
last value is the test statistic. 
3. Program Characteristics 
а. Limitations 
Maximum number of data points: 30, in one vari- 
able. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Lillie 
S unen: Lilier, Wvar,Expr,Norf,rFindt 
(3) External: Shsort 
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As OVERVIEW 


With a sample of 30 or less data points, test *o see 
if «ne sample is froma normal distribution of unspecified 
parameters. Although Crunch computes the sample mean and 
variance, they are not displayed. No statenent about the 
population parameters is made by this test. 


2. Level of Measurement 





Interval 
3. Assumptions 
The sample is randomly drawn from the population. 
4. Hypothesis 
NAPA) asa normal distribution of unspecified 
parameters, and S(x) as the distcCibDution fünction of the 
population, 
a. Null hypothesis 
HO: P*(x) =S(x) for all x 
b. Alternative hypothesis 


H1: F*(x) #S(x) for at laast one x 


В. ACTION REQUIRED BY USER OF PROGRAM 


If there is more than one variable in the test, the 
user must specify which one is to be utilized. тас 
variable has nore than 30 elements, the goodness of fit test 
will be automatically offsred. me St, nó further choices 
are necessary. 
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ses Sect л ТГ С for a dstailed expianation of 
Eus fcllcwing. 

ПШ ОЛЫ Туре LOCHI I£ data iS already ia a data 
file, enter "Y" to first question, then enter file name; 
when offered a test, type 22. If data needs to be entered, 
enter "N" for first question, "ny" for second; when data 
entered, choose to do a test; when offered a choice of 
ests, type 22. ieee "y". .."7'"t ror desired output 
muro. 

b. Variable and Parameter Choice 
When prompted by Lochinvar: 
Enter index of variable to be used in test, if 
there is more than one in the data set. 
а. Test Execution 
Tc perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
5. Qutput Available t9 the User 
The program CRUNCH displays the output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
data set andvariables used are at the top of the output 


mere. The test statistic will also ba included. 


C ACTIONS PERFORMED BY THE PROGRAM CRUNCH 





a. Compute the Sum of Squared Deviations 
The sum of squared deviations is the sum over n 
of the square of the difference between X(i) and the mean of 
X. The program actually figures the sampl2 variance and 
mu iplies by (n-1). This is the denominator of the test 


Epp" St dic. 
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b. Compute the Sum of the Weighted Differences 


uocum eM = БОЕ Ва ео the order statistics, 


ОЛ) ,..., XE). Subtract values with the same depth from 
the end pcoints,=.9, Subtract X(n) - X(1), СЕТ) - 
MN... px (n-1+1) - X(i). The n is even, there are n/2 


differences: if n is odd, there are  (n-1)/2 differences, 
with the median excluded. Multiply these differences by 
a(i) weights, provided by Lochinvar. Sum the weighted 
Beererences. This is the numerator of the test staistic. 
Cc. Compute the Test Statistic 
Divide the sum of weighted difference by the sun 
of squared deviations. 
2. Program Method of Operation 
The subroutine finds the sample variance by calling 
Muvar. Shsort is used to order the sample into order 
EN eics X1)... —,X(n). Ado for Losp, 1=1 to *runc(n/2), 
is used to calculate the sum of a(i)* (X (n-1+1)-x(1)); the 
a(i)'s are contained in the rp vector. 
3. Program Characteristics 
a. Linitations 
Maximum number of data points: 400. Maximum 
number of variables/treatments: 15. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Wilkes 
(2) Crunch: Wilksh,Muvar 
(3 External: Shsort 
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XXVII. SMIRNOV TEST 


A. OVERVIEW 


1. General Purpose 

With two samples, X and í; each drawn from a 
different population, test whether the underlying population 
distributions F(x) and G(y) could be equal over the entire 
range of X and Y. 

The Smirnov and Cramer-von Mises test differ only in 
the computation of tha tes statistic, and therefore, the 
choice of hypotheses. 

2. Level of Measurement 

Ordinal 

3. Assumptions 

The samples arə drawn ranjomly from their popula- 
tions, and ar2 rankable against each other as well as within 
samoles. 

4. Hypotheses 
a. Two tailed test 
(1) Nuil hypothesis hd: F(x)=q(x) for all x 
(2) Alternative hypothasis hi: F(x)*g(x) for at 
leas* one x in range of sample 
b. Ones tailed test 
cgi nwb»po-hesss h9:- F(xpSaí(x) for ail x 
(2) Alternative hypothesis hil: F (x)»q(x) 
с. Опе tailed test 
(1) Null hypothesis hd: F(x) 29g (x) 
(2) Alternative hypothesis hi: F (x) <q (x) 
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В. ACTICN REQUIRED EY USER OF PROGRAM 


9 
t1 
(D 


r must designate which of two variables 


rj 
ЕП 
(D 


2 
used, if the data set contains more than two variables. 
user specifies the type of hypothesis. 
2. Program Activation when Using Data Values 
а. Initial Choices 

See secticn III.C for a detailed explanation of 
the following. 

In CMS, type LOCHI. If data is already in a data 
mee, enter "Y" to first question, then enter file name; 
when cffered a test, type 23. I£ data needs to be entered, 
enter "N" for first question, үш tor second; when data 
entered, choose to do a test; when offered a choice of 
tests, type 23. Ao un iw. oe for desired output 
me. 

b. Variable and Parameter Choice 
When prompted by Lochinvar: Enter indices of 
variables to be used in test. 
3. Test Executior 
To perform the test, after the above entry of data 
and parameters, type CRUNCH while in CMS. 
4. Qutput Available to the User 
The program CRUNCH displays tne output file to which 
the data has been sent, e.g., "U LISTING". The names of the 
đata set and variables used are at the top of the output 
ШЕШЕ. The test statistic will also be listed. 
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eee ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


1. Description of Cal 
a. Calculate the D 


X and Y are sorted 


¡como der. "Statistics, 
NU ...,X(m), and Y(1),...,Y (n). She counter for X IS 1, 
the counter for Y is j. Compare X(1) with Y(1). If X(1) < 
Y(1) then increment i; if X(1) > Y(1) then increment j. The 
difference between the empirical distribution functions is 
(i-1)/m minus (j-1)/n. After each comparison of X(i) and 
Y(j) the appropriate counter is incremented and the differ- 
ence is calculated and stored in the vector S. The counter 
i is incremented until it reaches (m*1); j can reach (n+1). 
If X(i) = Y(j), increment i, calculate and store the differ- 
ence, then increment j, and compute and store the differ- 
ence. There will be (mtn) elements in S. 
b. Calculate the Test Statistic 
For a two tailed test (hypothesis А), the 
greatest absolute difference is the test statistic. For 
K Othesis B, the largest difference is the test statistic. 
For hypothesis C, the smallest difference is the test 
eeetistic. 
2. Program Method of Operation 
Tnessubrou’rine Smirn calls Shsort to rank X, then 
rank Y. It calls the subroutine Finds to calculate the 
difference vector S, then chooses the test statistic based 
on the hypothesis specified. 
3. Program Characteristics 
a. Limitations 
Maximum number of data points: 400. Maximun 
number of variables: 2. 
b. Subroutines/Procedures Used 
(1) Lochinvar: Smirnov 
(2) Crunch: Smirn, Finds 
(3) External: Shsort 
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А. | OVERVIEW 


1. General Purpose 
With two samples, X and Y, test whether the under- 
lying population distributions F(x) and G(y) could be equal 
over the entire range of X and Y. 
The Smirnov and Cramer-von Mises test differ only in 
Ae Computation of the test statistic, and therefore, the 
choice of hypotheses. 


= 
2. Level of Measurement 





Interval 
3. Assumptions 
The samples are randomly drawn from the respective 
populations, and are independent of sach other. 
4. Hypothesis 
a. Null hypothesis 
HO: F(X) =G(X) for all x 
b. Alternative hypothesis 
H1: F(x) =G(x) for at least ons x 


В. ACTION REQUIRED BY USER OF PROGRAM 


е more than tw> variablas in +the data 
set, specify which two are in the ta 
V 


2. Program Act} 


ШИС e OC LOCHI. I£ lata is 


e”, enter "Y" to first question, then enter file name: 
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when cffered a test, type 28. If data needs to be entered, 
enter "N" for first question, MU co Second: when data 
entered, choose to do a test; when offered a choice of 
tasts, type 28. A A тог desired output 
ple. j 
b. Variable and Parameter Choice 
When prompted by Lochinvar: 
Enter indices of variables to be used in test., 
if the data set has mere than two. 
3. Tesi Execution 
To perform the test, after the above entry of data 
and parameters, *ype CRUNCH while in CMS. 
4. Output Available io the Usar 
The program CRUNCH displays th2 output file to which 
the data has been sent, e.g., "JU LISTING". The names cf the 
data set and variables used are at the top of the output 


wn In addition, the tast statistic is listed. 


C. ACTIONS PERFORMED BY THE PROGRAM CRUNCH 


1. Description of Calculation of Test Statistic 
a. Calculate the Difference Vactor S 
This step is exactly the same (using the same 
subroutine) as in the Smirnov test. See section XXVII for 
details. 


eccal culating the Test Statistic 
Find the sum of the squared elements of the S 
vector. With m as the number of elements of X, and nas the 
number of elements of Y, multiply the sum of squared S(i) 
times (m*n)/(a*n)**2. This is the test statistic. 


2. Program jetacd cf Operarior 


Thesubroutine Crmvon calls Shsor+ to order X and 
Seer Y into order statistics. суо cuoco is Finds finds 
the vector 5. Ыз ЛО ОШ СС, == ro (men),  Crmvon 
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squares and suns 5 (і). That sum is then multiplied by 
(m*n) / (m+n) **2. 
3. Program Characteristics 
a. Limitations 
Maximum number of data points: 400. Maximum 
number of variables: 2. 
b. Subroutines/Procedures Used 
{1) Lochinvar: Cramvon 
(joe Caine n.  czuvon, Finds 
(External; Si Soft 
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APPENDIX A 
TUTORIAL: LINKING TO AND USING THE COMPUTER PACKAGE 


eee PURPOSE 


This appendix contains three things: instructions on 
Howe to link to the class disk to perform nonparametric 
tests; ЕСС Шоп ons how toc copy the programs onto a 
user's A disk; and an example of how to use the program. 


N 


B. LINKING TO THE CLASS DISK 


To perform the test, a student can link to user number 
2266p, read only password Ə BRIEN, and use the programs there 
to perform nonparametric tests. The steps are: 

1. Linking to the Class Disk 
In CMS, type "CP LINK 2266P 191 192 RR". The 
computer will type back ENTER READ ONLY PASSWORD. Enter 
"OBRIEN". After receiving back the "R" message, enter ACC 
192 B/AÀ. 
2. Using the Programs 
тот 


An example o he use of Lochinvar and Crunch is 
given belcw. Detailed instructions for use are given in 
section III and the sections on specific tests. 

3. Releasing the Class Disk 


After you are done, type RELEASE 192. 


C. COPYING THE PACKAGE 


Three files are needed to run nonparametric tests: 
EET MODULE", “CRUNCH TEXT", and "CRUNCH EXEC". L oe; sana 
two are compilations of *he source code, contained in "LOCHI 
PaGaL" and "CRUNCH —FORTRAN", JOE OO po toy ot thsss 


programs: 
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I Linkweo the Class Disk 

2. Сору the File: Туре XEDIT «filename» «filetype». Type 
SAVE = = A in the space marked ==> at the bottom of the 
screen, then FILE. The file will be on tne A disk. 

3. Release the Class Disk 


D. AN EXAMPLE OF TEST USE 


To fully understand this example, read or review 
eet ons III and XXI of this text. The data set used is 
from Conover [ Ref. 1: p.319]; the tes- used is the Friedman 
test. The example shows the program's prompting ín capital 
letters, and the user's response in lower case, with a <== 
Ae he right. 

1. Entering the Dat 


a File 
a. Preliminary Steps 

There are four initial steps to perform before 
entering a data file. fie firStaoe tO activiate Lochinvar, 
then to request not to use a previously entered file, then 
to enter the file by data values, and finally to write the 


ИШ ОГаасїсп to data file "lochi a't. 


Schi == (1) 
EXECUTION BEGINS... 
DO YOU WANT TO PERFORM A TEST ON A PREVIOUSLY ENTERED LATA SET? 
ENTER Y OR N 
n <== (2) 
YOU HAVE A CHOICE OF TWO METHODS TO ENTER DATA: 
1. ENTER EACH VALUE OF DATA SET. VALUES MUST BE NUMERICAL, AND 
MEASUREMENT AT LEAST ORDINAL. ALL TESTS MAY BE PERFORMED THIS W 
2. ENTER ONLY THE COUNT OF SUCCESSES/FAILURES OR ROW/COLUMN CELL 
COUNT. LATA VALUES AND LEVEL OF MEASUREMENT MAY BE NOMINAL, BUT 
COUNTS MUST BE NON-NEGATIVE INTEGERS. THE FOLLOWING TESTS MAY BE 
PERFORMED: BINOMIAL, MCNEMAR, R*C CONTINGENCY, CHI-SQUARE 
GOODNESS ОЕ ВІТ. 
ENTER V FOR VALUE OR C FOR COUNT 
у <== (3) 
ENTER (A-F) TO WHICH DATA GOES 
a == 
b. Enter Information cf Label of Data Set 
The label needs three types of information: the 
data set name, the number of variables, and the variable 
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names. For this example, we are using the data of exercize 
1, page 309 in Conover, with four variables, named the names 
of the seasons: 
ENTER NAME OF DATA SET | , 

prob 1,p 309, number of births by hospital <== (1) 


a (2) OF VARIABLES OR TREATMENTS 


ENTER NAME OF _ VARIAELE NR 1 
Winter <= 3a 
ENTER NAME OF va IABLE NR 2 
en L 
ENTER NAME OF VARIAELE NR 3 
4 


summer <== MER 
ENTER NAME OF VARIABIE NR 
fall < 


C. Enter Data Elements 


Next, enter the data when prompted. Enter either inte- 
aS Or real numbers, i.¢., with or without decimal points; 
the program will read them as reals. Leave 2 space (not a 
comma) in between numbers. Enter all elements of a variable 
on one line if possible; if not, there will be a chance to 
add more elements after the "ENTER" key is pressed. There 
will also be a chance oS chsckeor corcect entries. In the 
following example, а mistake is made, then corrected, in 
entry of variabie one. In variable two the data is checked 
and found correct; in variable three it is not checked. In 
variable four, the first four data elements are listed, the 
"ENTER" key is pressed, the last three data elements are 
added, and the data checked. 


(1) Enter, Check, and Correct the Values of 


Variable One: 


WHEN ENTERING DATA ELEMENTS <1, START WITH 0. 
ENTER VALUES FOR VARIABLE 1 
22 9 98 19 21 58 43 <== 
ADD MORE ELEMS TO SAME VARIABLE? 
OR CHECK THIS VAR? OR GO CO NEXT? 
EE ADD), C (CHECK) O OR S(GO ON) 
C == 
NUMBER OF ELEMS ENTERED 7 
NUMBER OF ELEMS IN THIS VAR 7 
92.000 9.000 98.000 19.000 21.000 58.000 43.000 
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(The last entry should have been '42' vice '43', 
so re-enter variable:) 


WANT TO RE-ENTER THIS VARIABLE? 
B t CR N 
ENTER VALUES FOR VARIABLE 1 

92 9 $8 19 21 58 42 
ADD MORE ELEMS TO SAME VARIABLE? 
OR CHECK THIS VAR? OR GO TO NEXT? 
A CRECK)OR G(GO ON) 

С = == 
NUMBER OF ELEMS ENTERED 7 
NUMBER OF ELEMS IN THIS VAR 7 
92.000 9.000 98.000 19.000 21.900 58.000 42.000 
WANT TO RE-ENTER THIS VARIABLE? 
ENTER YORN 

n == 

(2) Enter, Check Values for Second Variable: 

ENTER VALUES FOR VARIABLE 2 

112 11 109 26 22 71 49 == 
ADD MORE ELEMS TO SAME VARIABLE? 
OR CHECK THIS VAR? OR GO TO NEXT? 
ENTER A (ADD) , C (CHECK) OR S(GO ON) 

Є = 
NUMBER OF ELEMS ENTERED 14 
EMBER OF ELEMS IN THIS VAR 7 
112.000 11.000 109.000 26.000 22.000 71.000 49.000 


mnis tine, it is right thea rs tine, 
Econo re entrys) 


T TO RE-ENTER THIS VARIABLE? 
ER I OR N 

[== 
(3) Enter Values for Third Variable: 
ALUES FOR VARIABLE 3 
S2 19 23 51 44 <= 
EMS TO SAME NOS 


L 
н VAR? OR GO 


IABLE? 
O NEXT? 
),C(CHECK)OR G(GO ON) 


(4) Enter, Add to, Check Values for Fourth 


ENTER VALUES FOR VARIABLE 4 
12 €1 18 
ADD MORE ELEMS TO SAME VARIABLE? 
oR CHECK THIS VAR? OR GO TO NEXT? 
ENTER à (ADD), C(CHZCK)OR G(GO ON) 

a — к= 
ENTER VALUES FOR VARIABLE 4 

24 62 41 <== 

ADD MORE ELEMS TO SAME YARIABLE? 
OR CHECK THIS VAR? OR GO TO NEXT? 
ENTER A (ADD), C(CHECK)OR G (GO ON) 

© == 
NUMBER OF ELEMS ENTERED 28 
NUMBER OF ELEMS IN THIS VAR 7 
77.000 12.000 81.000 ‘18. 900, 24.000 62.000 41.000 
WANT TO RE-ENTER THIS VARIABLE 
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ENTER Y OR N 
n с== 


IL OS iaa oOfters a chance to re-enter the 
entire data set 
YOU HAVE ENTERED 4&4 VARIABLES 
WITH THE FOLLOWING NUMBER OF ELEMENTS IN EACH 
VAR NR NR OF ЕЕЕ 
2 7 
a 7 
DO YOU WISH TO RE-ENTER THE ENTIRE DATA? 
ENTER Y OR N 
n < = = 
(6) Lochinvar Determines the Variables are of 


equal size: 
ШИНЕ VARTABLES/TREATMENTS ARE OF EQUAL SIZE: 
BATRED CCMPARISON TESTS ARE POSSIBLE 

(7) Choice to do a Test witn this DATA 
„Р А = SW IAS DATA SET? 


ENT Y OR 
y <== 


The relevant choices are Jd) LIS 
meee ONG’ ,...,'Z LISTING’. Ao cenas) 15 al:isady in the 
file will be written over. 

АНЕ EORBOUTPUTFFILE,W=-2? 


r Cho lce GE RTOS= 
Lochinvar will offer “est according to the number 
of variables in the data set and whether the variables are 
of equal length. Test number 17 is desired; the Friedman 
rather than the Durbin is chosen. 


MECK ANONG THE FOLLOWING NON-PARAMETRIC TESTS: 


Ii ZO 
ih 
H 
„> 
p 


TEST FOR TREND <2 VARS,PAIR> 


600. з о 9? @ @ 
i 
H ^H dst 
Be Denen 
Ex] (13 Ex] i 73 G2 m9 NV Ex] 
gies E "d 


Arun. 
eA us WHO AHE AO wW 


OO-MONZZSOcsH 
t463 tw 

rJ LO A A A 
O a 

Riaza "d 

td MAHAL cá 


— — — O 02D ЈОЛ шәм) 
DH HH 


NO 

ZAZZOO Uum 
tI ROBO 

по рО р ИА =(nr3: 
b= t> C) O (n 2¿ WD i eH 
DUO PAGA 
ps tp. CÓ 

Hir4r3teijHO V 
AAMC 





13. MEDIAN 
13. KRUSKAL/WALLIS 
15.SOUARED RANK TEST FOR ET VARIANCE 
16.HARTLEY TEST FOR EQUAL VARIANCE 
17.FREIDMAN OR DURBIN <BLOCKED> 
18.QUADE <BLOCKED> 
19.COCHRAN <BLOCKED> 
20. KOLMOGOROV(ONE SAMPLE:PARAMETERS SPECIFIED 
ERS LILLIEFORS on SAMPLE;NORMAL OR EXPONENTIAL) 
22.WILKES-SHAPIRO TEST FOR NORMALITY 
23. SMIRNOV {2- SAMPLE) 
20. CRAMER-VON MISES [e ALPE) 
ШООУОПО WANT TO SEE THE LIST AGAIN? 
RR 1 OR N 

n = = 
ШТЕР e NUMBER OF “A TEST LISTED ABOVE 
mats WILL ALLOW THE FRIEDMAN OR THE DURBIN TEST 
<DURBIN IS INCOMPETE BLOCK DESIGN: NEEDS FILLER» 

DO YOU WANT TO PERFORM THE FRIEDMAN TEST? 

ENTER Y OR N 
y == 
c. Choice of Test Paraneters 
For this example, only the first, second and 


fourth variables are used in the tast, and the third block 
of the data set (i.e., the third element of each variable) 
is excluded from the test. The final test parameter entered 


is the level of significance. 


(1) Variables Included in Test 
PARAMETERS <1, START WITH 0. 

LL 4 INCLUDED IN TESTI 

ARIABLES ARE INCLUDED IN TEST? 


Y 
NDICES OF VARIABLES ТО ВЕ IN TEST 
INDEX OF INCLUDED VARIABLE 1 


INDEX OF INCLUDED VARIABLE 2 
¿INDEX OF INCLUDED VARIABLE 3 


4 


T 
en інн ин z 


(2) EVelude Third Block 
OU Y OR N TO EXCLUDE ANY BLOCKS FROM TEST 


ENG ORDER, ENTER IND 
EX FOR EXCLUDED BLOC 


Су 

LJ 

= 
UU N 


ICES Ll > E 
K NR 1 
pecify Level of Significance 


S 
ОР SIGNIFICANCE, ALPHA 
i SPART WITH J. 
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8. 


Run Test with Program Crunch 


їп СМ$, И CRUNCH 


s О ccaput the test statistic,then display: 


THE RESULIS ARZ LISTED IN FILE 
U LISTING 


To check the results, type XEDIT Ü LISTING 
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APPENDIX B 
THE PROGRAM LOCHINVAR 


A. Purpose 


This appendix is intended to assist in modifying +he 
program Lochinvar, or in implementing it ona computer 
system other than the IBM-3033 at the Naval Postgraduate 
School. The last part of this appendix, general description 
of program operation, may be of interest to 2 user who is 
experienced in the use of Lcchinvar. 

Section III of the text describes the general use of the 
progran, while Sections V through XXVIII give specific 
instructions on responding to Lochinvar's promptings for the 
various tests. Appendices C and D explain in detail 
ExDLnvar's two output files, the data file and the options 
file, respectively. 

The source code is contained in Appendix H. 


B. DESCRIPTION OF PROGRAM CHARACTERISTICS 


1. Languag 
Lochinvar was originally written and tested in 


KD 


Waterloo Pascal; it is presently in a form compatible with 
the IBM virtual system Pascal compiler, PASCALVS. The 
program's syntax is consistent with the Pascal described by 
Zaks (Ref. 3]. 

2. Input and Qutrut Files 





In Lochirvar, input or output files are referred to 


by One of nine text definitions. "Qin" refers to terminal 
But; "gout" refers to terminal input; "op" refers to the 
n ons file, "Iochi ор". Beta through £f refer to 


s files "lochi a" through "lochi f"; these data files are 
used both as input and output. 
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ИО USangea file Lor output, Lochinvar must 
cpen that file by the "rewrite" command. For the data 
files, ПОКОЕ СУБИ he procedus Newldata; for hs 
Options file, it occurs in the procedure File_io. All seven 
Be those files are explicitly defined as fixed format, 
Mecucal record length 80. The form of the rewrite command, 
Ene illustrated for "lochi a", is: 

REWRITE (A,' NAME = LICHI.A.A,RECPM=F,LRECL=80') 

The terminal output is opened in the main progran, 
using the command "TERMOUT(QOUT) ". 

Prior to reading a file as input, Lochinvar must 
open that file for reading with the "reset" command. Bor 
Na tiles, this occurs in the procedure Check file. 
The terminal input file must be reset prior to each time 
data is entered from the keyboard. Rather than having reset 
commands scattered throughout the program, the procedure Rst 
is invoked. Rst resets the terminal for input with the 
command "TERMIN (QIN) ". 

3. Dimensions of Data Structures 

At the beginning of Lochinvar are six constants. 
Tot_data defines the total number of data elements contained 
in the data set; it is currently set at 400. Tot vars is 
the maximum number of variables in a data se+ plus one; it 
is currently set at 16 (for 15 variables). L names is the 
length of variable or data set names; it is currently set at 
50. Mba Ll 1 par, and L_e_b are the number of real 
parameters, the number of integer parameters, and number of 
excluded blocks, respectively; each is currently set at 50. 

Changing the setting of any one of those numbers 
will change all subsequent dimensioning within the progran. 
It is not recommended that L_names Je increased beyond 80, 
since this would place names on moz? than one line (card 


image) of the data or options file. 
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4. Type and Variable Declarations 
ch nstant declarations, the type declations 
define arrays, sets and types. Below that are variable 
declarations. All parameters are defined globally. 
о 


m 
Variables, Arrays, Sets 


5. Description 
Below the variable declarations, a comment section 
gives brief descriptions of each variable, set or array 
passed as a parameter. 
6. Description of Procedures and Functions 


Each subprogram element contains a brief comment 


section describing its purpose. 


C. DESCRIPTION OF PROGRAM OPERATION 


The main program offers the user three choices: use a 
previously enter data set to perform a test, enter a new 
qata set, or entsr cell counts. See Section III of the text 
for a complete explanation of these choices. Depending on 
the user's choice, the main program will call one of three 
master subprograms tc prompt the user through the data and 
parameter entries: 
1. New Data 
The master  subprogram New Data prompts entry ofa 
new data set and offers the user the chance to perform a 
test with the newly entered data. Steps in this process: 
a. Data Set File Type 
New data asks the filatype for the data, "а" 
rough "£8", 
b. Names and Number of Variables 
New_data calls Namer, which asks the data set 
name, the number of variables, and the variable names. 
C. Data Element Values 
If there are two variables, New da-a offers +he 


Eco of entering data pair by pair. If the user chooses to 
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do this, New_data calls E_pairs. Otherwise, tha procedure 
E_by_var grompts əntry of data variabl2 by variable. 
do WE Cingr of Data File 
New_data writes the data ser file. 
€. Test Choice 
New data asks if a test should be performed with 
this data. If yes, File,io and Which test are called; these 
procedures are described in the next paragraph. 
2. Qld data 
If the user indicates that 3 previously entered data 
set is to be used, the procedure Old_data is called. m 
performs the following: 
a. Data File Choice 
Old data asks the user's choice of file type of 
old data, icc EJ Uu chrough "lochi  £'. Gea data “calls 
Chech file, which read and displays the name of the indi- 
cated data set. ТЕШ Ос ichezrestreet data set, it also 
reads the number of variables and number of elements per 
variable. 
b. Output File Choice 
The procedure Pile io asks to which output file 
the test results shculd be sent. It writes the number of 
the data set file (NIF) and the number of the «est results 
file (NOF) to line one of the options file. 
c. Test Selection and Parameter Choice 
Old_data calls the procedure Which_test. This 
procedure offers some or all of th2 nonparametric tests, 
depending on the number of variables in the data set and 
whether tte number of elements per variable are equal. Once 
the choice is made, Which, test writes that choice to line 2 
Of the options file, along with whether the test involves 
Sor. counts. Which_test then calls the procedure for the 
chosen test. 
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d. Test Variable and Parameter Selection 
Fach test procedure is somewhat different, 
however, there are common points. If the test involves 
cells, the test procedure prompts cell related parameters, 
then calls NW cells to write those parameters to line 3 of 
the options file (see Appendix D for details of that file). 
If the data set contains more variables than the minimum 
needed for the test, Which vec will prompt the user to enter 
which variables are included in the test. If the test 
involves paired or blocked data, the procedure Exclud_ 5 
offers the user the chance to exclude pairs or blocks from 
the test; in other cases, None ex writes а "0" to line 5 of 
the options file. The test procedure then prompts entry of 
various integer and real parameters, and calls the procedure 
W param to write those parameters to lines 6 and 7 of the 
options file. 
3. Се1_©з51 
If the user specified to enter cell counts, the 
procedure Cel test is called by the main progran. It calls 
File io to write the NIF and NOF to the data file. It calls 
Namer to get the data set and variable names. The choice 
test dictates which procedure prompts the cell count entry. 
Cel test writes the data set name and variable names to line 


BEST the options file. 
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APPENDIX С 
THE DATA FILE 


ES PURPOSE 


This appendix explains the structure of the data files, 
which contain the data set and variable names, che number of 
variables and number of elements per variable, and the data 
elements of the variables. This will aid the user in under- 
standing the computer package. More importantly, 1th wu I 
allow the user to test data generated by another program, or 
read from cards or tape, without having to load the data 


into Lochinvar by hand. 


BEN METHOD OF WRITING DATA FILES 


Cro mts of Data Fil 


LV 


comp222: 


All names are written in character fields of length 


fib 
ч {з 


=s 





E from the first to the forty-eighth position on a line. 


Meee integers are written in a 1615 format,» i.e., With the 
least significant digit rof tha integer on DOS Zon 
ЕИО... 80. All data elements are read as real numbers; 


they are writcen to be read in an 8F10.3 format. This means 
that there are eight fields for real numbers per line, 
1-10, 11-20,...,71-80, and the real numbers have chree digits 
IN ne right ЭЁ the decimal point, e.g., 789.123.  Lochinvar 
includes a decimal pcint in all real numbers; if this is not 
possible, the least Significant digit of an entry (the 0.001 
ASE) must ba right justified in the field, i.s., aligned 
em column 10,20 ,.. . „80. 

TF the user has a data set whose data elements are 
all integers, there is a way to have the data file written 
Without converting all the integers to real numbers. The 


est significant digit of the integers can be aligned on 
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AS a sand ehe Program Crunch wiil read then 
as real numbers with the same value as original integer 
data. 

If the data elements of the data set are either all 
very large or all very small, they may not fit into the 
F10.3 format. A quick fix to that problem is to rescale the 
Values, e.g., multiply very small numbers by 1000, or divide 
large numbers by that amount. Ths validity of the statis- 
tics will not be effected by that transformation. 

If there are both very large and very small numbers 
in the data set, there will be numerical errors in single 
precision Fortran for any tests which use an interval level 
cf measurement. For tests which uses ordinal or nominal 
level of measurement, any monotonic transformation, such as 
y=1n x, could be used +o allow both very large and very 
small data values to fit into the F10.3 format. 

2. Layout of the Data Fil? 
a. Data Sst Name 
The data set name occupies the first line of the 
data file. The name is from 1 to 43 characters long. on 
the same line as the name, inan fortyeninthspros:ıtron, 
there is the character "|", which on IBM keyboards is the 


upper case "1". For example, a data set file name: 
This is a data set name | 


b. Number of Variables and Elements pez Variable 

On the line following the data set name, there is 
a line of integers. The first integer is the number of 
variables in the data set. The second integer is the number 
of elements in the first variable; the third integer is the 
number of elements in the second variable, and so on. There 
must be non-zero integers in the first two positions. 
Locninvar writes zero where there are no values, but that is 
not necessary. AS mentioned above, integers are right 
.u 3 C led on position 5,10,...,80. 
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Let theldata sat consist SE two variables. The 
first variables has two data elements, the second has four. 
the first two lines would appear as: 

This is a data set namne | 
2 2 4 0 0 0 0 Оше» 
с. Data elenents 

The data elements are listed variable by variable, 
i.e., all the elements of the first variable are listed, 
then all cf the second, et cetaera. As mentioned above, the 
data elements are read as real numbers in the F10.3 format. 

Let the elements of the first variable be 27.5 and 
31; let the elements of the sacond variable be 13, 17, 25 


and 35. The first three lines of the data file would appear 


as! 
This is a data set nane 
2 2 4 0 0 0 0 Q 0 0 ОЗ 
27.500 31.000 13. 000 17.000 25.000 3520900 
The zeros to the right of the decimal point are 
written by Lochinvar but are not necessary. The number 


could have been written without decimal points, with proper 
alignment, as shown below. However, when numbers are 
written with decimal points, a mistake in alignment is no 
problem; without decimal points, alignment is crucial. 

Using only alignment for data elements: 

This is a data set name | 
275 K буз S 47 р ET 
d. Variable names 

The variable names begin on the line directly 
follcwing the last data element line. Each variable nane 
occupies a line; there must be the same number or lines for 
variable names as there are number of variables indicated in 


line two. 
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Let freshman be the name of the first variable and 


senior be the name of the second. 


This is a data set na 


= 


a | 
2 2 4 0 0 0 Q Q Q 0 O... 
27.500 31.000 13.000 17.000 292000 35.000 
freshman 
senior 


ШОО LOCATION ON THE USER'S A DISK 


1. Default Location 

As mentioned repeatedly throughout the text, by default 
mae data files are named "lochi a",...'"lochi f". Lochinvar 
writes files to those six locations; Lochinvar will not be 
easily modified unless the user is experience in programming 


Pascal. 


ct 


2. Reading Files from oth? 


In 


than Default Locations 
ne рога Crunch reads '"loch* a",...,"lochi f" based 

en the file definitions in the file "CRUNCH EXEC" and the 
number of input file (NIF) of the options file. "CRUNCH 
NEC" is explained in Appendix E, while the input fila is 
explained ín Appendix D. TO have Cruüúnch read from a prop- 
erly written data file named anything other than "lochi 
EE .,'"lochi f", the user will have to do the following: 

а. Write an options file for the data (see section 
III and Appendix D), either manually (not recommended), or 
else using Lochinvar, specifying a data set with the same 
general claracteristics as the test data set. For example, 
if the test data set is named QUEUE DATA, with four vari- 
ables of equal length, tell Lochinvar to run the desired 
test on data set "lochi a", given that "lochi a" has four 
variables of equal lenghth 

b. Since the number of input file written in the 
Seeeons file is "1", corresponding to the default location 
Boch a", change the file definition in the file "CRUNCH 
mee. The origin file "CRUNCH EXEC", 
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&TRACE 

&ERROR БЕХ 

FILEDEF 01 

LOAD CRUNCH (START 
beccmes, 

ETRAC 

EERROR EEXIT 99 

FILED 

LOAD CRUNCH (START 


C. 


It would 


rename the current 


nane 


"QUEUE DATA" 


Alternate 


PINO 
DISK LOCHI A 


F 01 DISK QUEUE DATA 


Method 
be easier, in the example above, 
file "lochi a" to some other name, 


ОЛ Оеп а" 
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APPENDIX D 
ТИЕЛОРТТОЧ5 FILE 


A. PURPOSE 


This appendix describes the options file. The twenty 
monparametric tests which require ordinal or interval data 
level of measurement all have options files with similar 
Format. The four tests which can handle nominal data have 
БЕЧО Ету idiocyncratic formats, which contain all the infor- 
Nation of other test plus something additional. 

This appendix is included for completeness of docunenta- 
mon of this text. It is not recommended that a user 
by-pass Lochinvar to write the options file by hand. If the 
computer package is implemented on an operating system which 
cannot run Pascal, though, that may b2 necessary. 


B. GENERAL INFORMATICN ABOUT THE OPTIONS FILE 


1. Name 
Lochinvar writes the options file to "lochi op" on the 
user's A disk. 
2. General Format Rules 
As in the data file, all integars are written in the 
format 1615, all real numbers are written as 8F10.3, all 
names are in 48 character lines. S22 Appendix C, paragraph 
В. , for all the different ways to have real numbers repre- 
sented for the program Crunch to read. 
3. Explanation of the Options File Forma* 
a. List of Line Numbers 
There are 14 line numbers: 1, 2, 3, 3a, 4, 4a, 5, 
Saye 0, 6a, 7, fa, 8, 8а. 
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b. Line Numbers in the Option File 

Each of the 14 line 
in cehe орао 5 
not a blank 


rather, the previous line is followed by the next. 


numbers occupies at least cne 


line (or card image) file, “Sunless that Line 


is omitted. There is Card for oni ted line 


numbers; 
and 3a are ommited, line 85 


For example, if line numbers 3 


immediately follcws line 2. 
The lines with "a" in their number may occupy more 
in an actual file, because 


than one line (or card image) 


they may contain more information than would fit on a single 
80 column card. 

CRU eS fOr Inclusion of Linss in Options File 
Lines 1 and 2 are included in every opticns file. 


Lines 3 and 3a are omit*zd unless the test is one 


four which handle nominal data. 
will be 


except when the user has entered cell counts (in one of the 


+he 


of 


Lines 4 and Ha included in every test 


four tests which use nominal data). Line 4 is a single 


positive integer value which tells how many values -o read 


in linse ца. 


Line 5 equals "0" and line 5a is omitted unless 
the user has excluded blocks of the data set from being 
tested. In that case, line 5 indicates the number of 
elements to be read in line 5a. 

Lines 6 and 7 are included in ever test, and 
will contain single integer values. If a value of "0" is in 


one of those lines, 
omitted. 
will be omitted. 
"0 = 


the number of 


E nct 


line. 


For example, if 
If the single value in lines 
Zt must 
elements to be read in 


For example, 


“then the corresponding "a" line will be 


"Q', then line 6a 
4, 9, oe 


representing 


line 6 has a 
апа 
be a positive integer, 
те соггесропа: па "a" 


петната value in line 5 1s "3", 


there will be three elements to real in line 5а. 
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The "a" lines may occupy as many actual lines in 
the data file to contain all the values to be read. 

Lines 8 and 8a are used only when the user has 
entered cell counts for a test involving nominal data. 
d. Content of the Lines 

(1) Line 1: the number of input file (NIF), and 
number of output file (NOF), both integers 

(2) Line 2: che number of the requested test 
(NTEST), and whether the test involves cell counts (CELLS), 
both integers,  CELLSz1 if the test is one of the four which 
can use nominal data. 

(3jykUD 1 0573 hes cell countar option (TYPCEL) , the 
number of rows in the cell matrix (NROW), ‘the number of 
columns in that matrix (NCOL), and th2 number of elements in 
line 3a (NTORD), all positive integers. Depending on the 
MyeGEL option, line За will contain either integers or 
reals. | 

(4) Line 4: the number of treatments +o be 
included in this test (NTREAT), a positive integer. Line da 
contains the indices of the variables of che data file which 
are included in the data set to be tested. 

(5) Line 5: thes number of biocks excluded from the 
test (NEXB). Lae 5a contains thes indices of excluded 
blocks. 

(6) Line 6: the number of integer parameters, a 
non-negative integer. If line 6 is greater than zero, line 
6a contains the integer parameters for the test. 

(7) Line 7: the number of real parameters, a non- 
negative integer. If line 7 is greater than zero, line 7a 
contains the real parameters for the test. 

(8) Line 8: data set nanz. Line 8a contains the 


names of data set variables. 
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C. EXAMPLE OF OPTIONS FILE FOR ORDINAL OR INTERVAL TEST 


Table 5 contains a representative data file, with the 
line numbers of the options file for the twenty tests which 
require ordinal or interval data. This example is for the 
test to perform the problem contained in Appendix A, i.e., 
problem 1, page 309 of Conover (ref 1]. In this example, 
three of four variables were used, and one block excluded. 
Line number are enclosed by < >. Comments are enclosed by 
(*-*) ; the comments refer to the names of variables, as 


mentioned in the previous section. 


<J 


| | 

| TABLE 5 | 

| Options | 

< 1> 1 11 *NIF=1,NO?=11*) | 

<2> 17 Q *NTEST =17,CELLS=0*) | 

<4> 3 *NTREA T=3 *) | 

| «ua» 1 y (*indices of treatments*) | 

| <5> 1 *NEXB*) | 

<5a> 3 *index of the block excluded*) | 

| <6> 0 *number of integsr parameters- О ж) | 
<7> 1 *number of reai parameters=1*) 

| <7а> 0.95 жгеаї parameter=).95*) | 

3 


o 
| 
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APPENDIX E 
THE PROGRAM CRUNCH 


A. PURPOSE 


This appendix is intended to assist implementing 
Crunch cn systems cther than tha IBM-3033 at the Naval 
Postgraduate School. It will also assist in modifying the 
Progran, саар ngor tions Of it “Or use in other 


programs. 


Bee DESCRIPTION OF THE PROGRAM STRUCTURE 


1. General 
While Lochinvar is designed to be used only as an 
integral unit, Crunch is separable. Therefor2, many things 
which are centralized in the main program of Lochinvar--such 
as dimensioning and variable declaration--are done in each 
subroutine of Crunch. Explanations and documentation are 
also at the subprogram level. 
2. Subprogram Grcups 
Crunch can be thought of as a system of five groups, 
each made up of several subroutines and functions. 
a. Main Program 
The main program, together with the subroutines 
Fname and Vname, form one group; they read the data set and 
variable names, call the correct data formation group, then 
call *he correct test subroutine, and remind the user which 
ENG contains the output. 
b. Data Formation Group for Cell Counts 
If the test involves calli cogit (Binomial, 
Quantile, McNemar, Test for Independance, Cochran), the data 
formation group Celler is called by the main brogram. Celler 


and the subroutines it calls produce the cell counts used as 
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mie taw Waterial for calculations in the test subroutine. 
Those cell counts are contained in the vector RCCT, which is 
dimensioned by the number of rows NROW and the number of 
columns NCOL. 
с. Data Formation Group for Data Values 

For all other tests, the master reader group is 
called. Mstrdr and its subroutines read the specified data 
file and the options file. They produce a list of the data 
elements which are relevant for the test,  i.e., MS ra 
includes only the data set variabiss indicated, and excludes 
any blocks or pairs specified. Thase data values are 
contained in the vector LIST. LIST is dimensioned by the 
pointer vector LPTR. LPTR (1) is always 1. LPTR(i) is the 
index of the first element of the ich variable in LIST. 
With k variables, LPTR(k+1) is one more than the lasx 
element of LIST. Ror eximple, 1 LISI contains three Vari- 


ables, with two, four and seven elements each, LPTR will 


have four elements, (15,7 ушу), In dereferencing variable 
values, the ith variable will have elements LIST(LPTR(i)) 
Enough LIST(LPTR(i*!)-1). NTREAT is the number of vari- 


ables included in the test. 
d. Utility Subroutines and Functions 
Many subroutines are called by more than one tes- 
subroutine. These versatile subroutines are grouped 
together. 
e. Test Subrcutine Groups 
Each of the 24 tests has a principle subroutine 
called by the main progran. In general, these subroutines 
perform the bulk of the computations for the test statistic 
or level of significance. However, some of the computation 
is done by calling other subroutines.  Subroutines which are 
called only by one test are listed contiguous to that test's 
subroutine. Subroutines which ar2 called by more than one 
meet ars listed in the utility subroutine group. 
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The test subroutines make use of either LIST or 
Peer in their computations. 
3. Language 
Crunch is written to be compatible with the IBM 


Fortran H compiler. Only the most rudimentary syntax was 
used. The only loop statements are the do for loops of the 
form, | 


DO 10 I=LL,UL 


where 10 is a line label for a continue statenerr, LL is the 


lower limit and UL is the upper limit of the loop. All 
other locps or branching is done with if statements. This 
very primitiva, unstructured Fortran should be compatible 


with virtually any computer system. 
ü. Input/Output Files 

The IBM file definitions are contained in the file 
ESHUSNCH EXEC". All files except 03 (terminal) are inplic- 
E fixed format, logical record length 80. Files 01 
eeeough 05 refer to "lochi a" through "lochi ="; file 06 is 
Ensobleh data"; file 07 is "lochi £"; file 10 is "lochi op"; 
BEES 11 through 16 are "u listing" through "z listing". 

The number of input file (NIF) and number of output 
file (NOF) are listed in line one əf the options file, and 
read by Crunch's main program. Paice ce Olt. si, 05,07» ¿andino 
are candidates for NIF; files 11,...,16 are candidates for 
NOF. File 08 is used only to remind the user of which output 
file contains the test result. File 06 contains system 
error messages, while file 09 contains intermediate computa- 
Ens of Crunch. 

The main program reads lines one and two of tha 
options file. Either Celler or Mstrdr read most of the Test 
of the data and options file.  Fname reads tne data set name 
and writes it to the output file; Vname does the sane for 


tne variable names. 
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The test subroutines write all of tne te 
Miles "u listing" through “z listing", except the cata set 


and variable names. 


5. Dimensions and Declarations 
At the beginning of the main progzan, and of 2ach 


subroutine, the variables used are declared and the arrays 
dimensioned. Ali declarations and dimensioning is explicit. 
Although arrays could be dimensioned by variables in subrou- 
emes, this is not done. For exanple, with TOTELM as the 
integer representing the total data 2lements of the test, 
Mest could be dimensioned in a test Subroutine as 
AS T (TO TELA) vic2 LIST (400). This was MOT donna; as 
explained above, Crunch is intended to ne separable. To 
change the capacity of the program, a global change command 
nay be issued, if the instailation has a good text editor 

Otherwise, the dimensions must be changed one by one. 

There are a very fəw arrays which consume inordinate 
amounts of storage fcr what information they provide. They 
have the dimension of 20,000; that dimension allows storage 
of a comparison cf each of 400 data sləmeats with sach other 
data element. On the IBM-3033, these large arrays require 
the user tc request 1 megabyte of storage. On smaller 
systems, these arrays may be redimensioned down to exactly 
the size necessary (if k=the number of elements, k(k-1)/2). 
An alternative is to eliminate them altogether, and take out 
test procedures which use then. 

Bee Dessrzptrion of Variables 

At the beginning of the program, there is a brief 
description of all variables and arrays which are used or 
passed by the main program. After the subprogran title 
line, з similar list defines “he variables used exclusively 


among subroutines of that subprogran. 
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Above the variable declarations in each test su 
tine, there is a description cf eaca of the integer and real 


parameters used in that test. 


i EXTERNAL SUPPORT 


The non-IMSL subroutine Shsort is essential to the oper- 
ШЕ Сп cf the program Crunch. Its documentation is available 
to any user at the Naval Postgraduate School. Any systen 
ОТТ routine can be used to support Crunch, which is 
currently configured to pass a real vector of values to be 
Escted, an integer vector (1,...,n) tO receive the order of 
the original data values, and the number of elements n. 

The non-IMSL Utest was used for the Mann-Whitney test. 
ts value is that it not only computes the test statistic, 
but also the level of significance. Molt i Gat tons to 
Múnich could easily allow computation of the test statistic. 
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MCNEMAR(0,2eFALSE, TRUE); 


END; 
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